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TITLE OF THE INVENTION 

ASSAYS TO MONITOR AMYLOID PRECURSOR PROTEIN PROCESSING 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application claims the benefit of U.S. Provisional Application No. 

60/360,274, filed February 27, 2002, the contents of which are incorporated herein by 
reference in their entirety. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
10 Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

15 FIELD OF THE INVENTION 

The present invention is directed to the field of Alzheimer's disease. 
In particular, the present invention provides novel methods of identifying substances 
that are specific inhibitors of various steps in the processing of amyloid precursor 
protein. 

20 

BACKGROUND OF THE INVENTION 

Alzheimer's disease is a common, chronic neurodegenerative disease, 
characterized by a progressive loss of memory and sometimes severe behavioral 
abnormalities, as well as an impairment of other cognitive functions that often leads to 

25 dementia and death. It ranks as the fourth leading cause of death in industrialized 

societies after heart disease, cancer, and stroke. The incidence of Alzheimer's disease 
is high, with an estimated 2.5 to 4 million patients affected in the United States and 
perhaps 17 to 25 million worldwide. Moreover, the number of sufferers is expected to 
grow as the population ages. 

30 A characteristic feature of Alzheimer's disease is the presence of large 

numbers of insoluble deposits, known as amyloid plaques, in the brains of those 
affected. Autopsies have shown that amyloid plaques are found in the brains of 
virtually all Alzheimer's patients and that the degree of amyloid plaque deposition 
correlates with the degree of dementia (Cummings & Cotman, 1995, Lancet 
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326:1524-1587). While some opinion holds that amyloid plaques are a late stage by- 
product of the disease process, the consensus view is that amyloid plaques are more 
likely to be intimately, and perhaps causally, involved in Alzheimer's disease. 

A variety of experimental evidence supports this view. For example, 
5 AP, a primary component of amyloid plaques, is toxic to neurons in culture and 

transgenic mice that overproduce A|3 in their brains show significant deposition of Ap 
into amyloid plaques as well as significant neuronal toxicity (Yankner, 1990, Science 
250:279-282; Mattson et al., 1992, J. Neurosci. 12:379-389; Games et al., 1995, 
Nature 373:523-527; LaFerla et al., 1995, Nature Genetics 9:21-29). Mutations in the 
10 APP gene, leading to increased A(3 production, have been linked to heritable forms of 
Alzheimer's disease (Goate et al., 1991, Nature 349:704-706; Chartier-Harlan et al., 

1991, Nature 353:844-846; Murrel et al., 1991,Science 254:97-99; Mullan et al., 

1992, Nature Genetics 1:345-347). Presenilin-1 (PS1) and presenilin-2 ( PS2) related 
familial early-onset Alzheimer's disease (FAD) shows disproportionately increased 

15 production of A|3l-42, the 42 amino acid isofonn of Aj3, as opposed to A|3l-40, the 
40 amino acid isoform (Scheuner et al, 1996, Nature Medicine 2:864-870). The 
longer isofonn of A(3 is more prone to aggregation than the shorter isoform (Jarrett et 
al, 1993, Biochemistry 32:4693-4697). Injection of the insoluble, fibrillar form of Ap 
into monkey brains results in the development of pathology (neuronal destruction, tau 

20 phosphorylation, microglial proliferation) that closely mimics Alzheimer's disease in 
humans (Geula et al., 1998, Nature Medicine 4:827-831). See Selkoe, 1994, J. 
Neuropathol. Exp. Neurol. 53:438-447 for a review of the evidence that amyloid 
plaques have a central role in Alzheimer's disease. 

A3, a 39-43 amino acid peptide derived by proteolytic cleavage of the 

25 amyloid precursor protein (APP), is the major component of amyloid plaques 

(Glenner & Wong, 1984, Biochem. Biophys. Res. Comm. 120:885-890). APP is 
actually a family of polypeptides produced by alternative splicing from a single gene. 
Major forms of APP are known as APP695, APP751, and APP770, with the 

subscripts referring to the number of amino acids in each splice variant (Ponte et al., 
30 1988, Nature 331:525-527; Tanzi et al., 1988, Nature 331:528-530; Kitaguchi et al., 
1988, Nature 331:530-532). APP is membrane bound and undergoes proteolytic 
cleavage by at least two pathways. In one pathway, cleavage by an enzyme known as 
a-secretase occurs while APP is still in the trans-Golgi secretory compartment 
(Kuentzel et al., 1993, Biochem. J. 295:367-378). This cleavage by a-secretase 
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occurs within the AP portion of APP, thus precluding the formation of Ap. In another 
proteolytic pathway, cleavage of the Met596-Asp597 bond (numbered according to 

the 695 amino acid protein) by an enzyme known as P-secretase occurs. This 
cleavage by P-secretase generates the N-terminus of Ap. The C-terminus is formed 
5 by cleavage by a second enzyme known as y-secretase. The C-terminus is actually a 
heterogeneous collection of cleavage sites rather than a single site since y-secretase 
activity occurs over a short stretch of APP amino acids rather than at a single peptide 
bond. Peptides of 40 or 42 amino acids in length (Api-40 and Api-42, respectively) 
predominate among the C-termini generated by y-secretase. Api-42 is more prone to 

10 aggregation than Api-40, is the major component of amyloid plaque (Jarrett et al., 
1993, Biochemistry 32:4693-4697; Kuo et al., 1996, J. Biol. Chem. 271:4077-4081), 
and its production is closely associated with the development of Alzheimer's disease 
(Sinha & lieberburg, 1999, Proc. Natl. Acad. Sci. USA 96:11049-11053). The bond 
cleaved by y-secretase appears to be situated within the transmembrane domain of 

15 APP. It is unclear as to whether the C-termini of Api-40 and Apl-42 are generated 
by a single y-secretase protease with sloppy specificity or by two distinct proteases. 
For a review that discusses APP and its processing, see Selkoe, 1998, Trends Cell. 
Biol 8:447-453. 

Much interest has focused on the possibility of inhibiting the 

20 development of amyloid plaques as a means of preventing or ameliorating the 

symptoms of Alzheimer's disease. To that end, a promising strategy is to inhibit the 
activity of p- and y-secretase, the two enzymes that together are responsible for 
producing Ap. This strategy is attractive because, if the formation of amyloid plaques 
as a result of the deposition of Ap is a cause of Alzheimer's disease, inhibiting the 

25 activity of one or both of the two secretases would intervene in the disease process at 
an early stage, before late-stage events such as inflammation or apoptosis occur. Such 
early stage intervention is expected to be particularly beneficial (see, e.g., Citron, 
2000, Molecular Medicine Today 6:392-397). 

To that end, various assays have been developed that are directed to the 

30 identification of compounds that may interfere with the production of AP or its 

deposition into amyloid plaques. U.S. Patent No. 5,441,870 is directed to methods of 
monitoring the processing of APP by detecting the production of amino terminal 
fragments of APP. U.S. Patent No. 5,605,811 is directed to methods of identifying 
inhibitors of the production of amino terminal fragments of APP. U.S. Patent No. 
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5,593,846 is directed to methods of detecting soluble A|3 by the use of binding 
substances such as antibodies. Esler et aL, 1997, Nature Biotechnology 15:258-263 
described an assay that monitored the deposition of A0 from solution onto a synthetic 
analogue of an amyloid plaque. The assay was suitable for identifying compounds 
5 that could inhibit the deposition of Ap. However, this assay is not suitable for 

identifying substances, such as inhibitors of f3- or y-secretase, that would prevent the 
formation of Ap. 

Various groups have cloned and sequenced cDNA encoding a protein 
that is believed to be p-secretase (Vassar et al., 1999, Science 286:735-741; Hussain 

10 et al., 1999, Mol. Cell. Neurosci. 14:419-427; Yan et al., 1999, Nature 402:533-537; 
Sinha et al., 1999, Nature 402:537-540; lin et al., 2000, Proc. Natl. Acad. ScL USA 
97: 1456-1460) but the identity of y-secretase has been more elusive. A pair of 
proteins known as presenilin-1 and presenilin-2 are viewed as possible candidates 
(Selkoe & Wolfe, 2000, Proc. Natl. Acad. Sci. USA 97:5690-5692). 

15 Presenilin-1 (PS 1) and presenilin-2 (PS2) are polytopic membrane 

proteins that are involved in y-secretase-mediated processing of APP. The most 
common cause of familial early-onset Alzheimer's disease is the autosomal dominant 
inheritance of assorted mutations in the PS1 gene (Sherrington et al., 1995, Nature 
375:754-760). These PS1 mutations lead to increased production of Apl-42 

20 (Scheuner et al., 1996, Nature Medicine 2:864-870; Duff et al., 1996, Nature 
383:710-713; Borchelt et al., 1996, Neuron 17:1005-1013). Similarly, certain 
mutations in PS2 cause familial early-onset Alzheimer's disease and increased 
generation of AP42 (Levy-Lahad et al., 1995, Science 269:970-973). Cultured 
isolated neurons from PSl-deficient mice exhibit reduced y-secretase-mediated 

25 cleavage of APP (De Strooper et al., 1998, Nature 391:387-390). It was suggested 
that PS1 might influence trafficking of APP and/or y-secretase or it might play a more 
direct role in proteolytic cleavage of APP. Directed mutagenesis of two conserved 
transmembrane-situated aspartates in PS1 was shown to inactivate y-secretase activity 
in cellular assays, suggesting that PS1 is either a required diaspartyl cof actor for y- 

30 secretase or is itself y-secretase (Wolfe et al., 1999, Nature 398:513-517). Moreover, 
Li et al., 2000, Nature 405:689-694 made photoactivatable derivatives of a highly 
specific and potent aspartyl protease transition state analog inhibitor and found that 
the inhibitor selectively labeled presenilin fragments. 
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Co-immunoprecipitation experiments have shown that PS1 and PS2 
interact directly with the immature forms of APP in the endoplasmic reticulum where 
the disease-associated amyloid Af3l-42 peptide is probably generated (Xia et aL, 1997 
Proc. Nad. Acad. Sci. USA 94:8208-8213; Weidemann et aL, 1997, Nat. Med. 3:328- 
5 332). Knock-out of PS1 activity greatly diminishes y-secretase cleavage of APP (De 
Strooper et aL, 1998, Nature 391:387-390). PS1 knock-outs do not exhibit total lack 
of y-secretase activity but knock-out of both PS1 and PS 2 activity does result in a total 
loss of y-secretase activity (Herreman et aL, 2000, Nat. Cell. Biol. 2:461-462; Zhang 
et aL, 2000, Nat. Cell BioL 2:463-465), suggesting that PS2 has a similar function to 

10 PS 1 in the processing of APP. 

Karlstrom et aL, (Journal of Biological Chemistry papers in press, 
published on December 13, 2001 as Manuscript C100649200) describes an assay 
designed specifically to identify inhibitors of y-secretase cleavage of APP. The 
authors inserted the GAL4 DNA binding domain fused to the VP16 transactivation 

15 domain into C99, a portion of APP containing the 99 carboxy-terminal amino acids. 
This fragment of APP contains the y-secretase cleavage site but lacks the [5-secretase 
cleavage site. Transaction of a UAS reporter plasmid by GAL4-VP16 confirmed 
cleavage of the Gal4-VP16/C99 substrate by y-secretase only. Thus, the assay is 
capable of detecting y-secretase inhibitors but not inhibitors of (3-secretase or other 

20 modulators of APP processing requiring the N-terminal domain of APP. 

Cao & Stidhoff, 2001, Science 293:115-120 described work in which 
the GAL4 and LexA DNA binding domains were inserted into APP to demonstrate 
the potential of the cleaved C-terminus of APP for transcriptional co-activation. In 
this article, a transcriptional factor was not fused to APP and no attempt was made to 

25 develop an assay for the identification of APP processing inhibitors. 

Sisodia, 1992, Proc. Natl. Acad. Sci. USA 89:6975-6979 described 
various changes in the amino acid sequence of APP in the region of the a-secretase 
cleavage site and the effect of those changes on cleavage by a-secretase. A change of 
K to V at position 612 of the 695 amino acid version of APP led to reduced cleavage 

30 by a-secretase. 

U.S. Patent No. 6,333,167 Bl discloses an assay involving DNA 
constructs encoding portions of membrane proteins containing sites that are 
susceptible to cleavage by proteases that are fused to transcriptional repressors. Such 
constructs are introduced into cells that contain a reporter gene under the control of a 
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promoter that is sensitive to the repressor. In the absence of an inhibitor of the 
protease, the fusion protein is cleaved by the protease, releasing a membrane 
protein/repressor fusion protein that translocates to the nucleus and represses 
transcription from the reporter gene. In the presence of an inhibitor of the protease, 
5 the membrane protein/repressor fusion protein is not released and thus cannot repress 
transcription from the reporter. An increase in reporter expression can therefore be 
used as a readout for the presence of an inhibitor. 

SUMMARY OF THE INVENTION 

10 The present invention is directed to methods of identifying inhibitors of 

the processing of amyloid precursor protein (APP) that are capable of identifying 
inhibitors of a number of steps of such processing. Unlike prior methods, the methods 
of the present invention can be used to screen for inhibitors of f$-secretase cleavage, y- 
secretase cleavage, APP extracellular signaling, or APP cytoplasmic signaling in a 

15 single assay. 

The methods employ a recombinant eukaryotic cell that is capable of 
processing APP. The cell has been engineered to express a fusion protein that 
contains amino acid sequences encompassing both the P-secretase cleavage site of 
APP and the y-secretase cleavage site. The fusion protein also contains a transcription 

20 factor fused in frame to the APP sequences. 

When the recombinant cell is further engineered to contain a reporter 
gene, in which transcription of the reporter gene is driven by a regulatory DNA 
sequence that is inactive in the absence of the transcription factor but active in the 
transcription factor's presence, a system useful for screening for APP processing 

25 inhibitors is provided. Since the recombinant cell has been selected so as to be 
capable of processing APP, the fusion protein will be processed, releasing the 
transcription factor and activating transcription of the reporter gene. The reporter 
gene has been preselected so that activation of the reporter gene leads to a detectable 
phenotype. 

30 The system is utilized by exposing the recombinant cell to substances 

that are to be tested for the ability to inhibit APP processing. Those substances that 
are actually inhibitors of APP processing will cause diminished processing of the 
fusion protein, leading to smaller amounts of the transcription factor being released. 



-6- 



WO 03/072041 



PCT/US03/05458 



This leads to less transcription of the reporter gene. This results in a decrease in the 
phenotypic effect of the reporter gene that can be observed 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1A-G shows a schematic diagram of several APP/transcription 

factor fusion constructs. 

Figure 2A-B shows the DNA sequence (SEQ ID NO:l) of the fusion 
protein APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695). 

Figure 3 shows the amino acid sequence (SEQ ID NO:2) of the fusion 
10 protein APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) with the various 

different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of p-secretase cleavage; 3 = K612V mutation; 4 = region of y-secretase 
cleavage; 5 = linker; 6 = TAT exon I; 7 = linker; 8 = amino acids 664-695 of APP. 

Figure 4A-B shows the DNA sequence (SEQ ID NO:3) of the fusion 
15 protein APP(l-65 l)wt, K612V-TATexonI(MlL) APP (664-695). 

Figure 5 shows the amino acid sequence (SEQ ID NO:4) of the fusion 
protein APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = K612V mutation; 4 = region of y-secretase 
20 cleavage; 5 = linker; 6 = TAT exon I; 7 = linker, 8 = amino acids 664-695 of APP. 

Figure 6A-C shows the DNA sequence (SEQ ID NO:5) of the fusion 
protein APP(1-651)SW, K612V, GAL4-VP16(delMet) APP (664-695). 

Figure 7 shows the amino acid sequence (SEQ ID NO: 6) of the fusion 
protein APP(1-651)SW, K612V, GAL4-VP16(delMet) APP (664-695) with the 
25 various different regions of the fusion protein demarcated. 1 = amino acids 1-651 of 
APP; 2 = region of (3-secretase cleavage; 3 = K612V mutation; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APP. 

Figure 8A-C shows the DNA sequence (SEQ ID NO:7) of the fusion 
30 protein APP(l-651)wt, K612V, GAL4-VP16(del Met) APP (664-695). 

Figure 9 shows the amino acid sequence (SEQ ID NO: 8) of the fusion 
protein APP(l-651)wt, K612V, GAL4-VP16(del Met) APP (664-695) with the 
various different regions of the fusion protein demarcated. 1 = amino acids 1-651 of 
APP; 2 = region of (3-secretase cleavage; 3 = K612V mutation; 4 = region of y- 
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secretase cleavage; 5 = linker, 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APR 

Figure 10A-B shows the DNA sequence (SEQ ID NO:9) of the fusion 
protein APP(1~651)SW, TATexonI(MlL) APP (664-695). 
5 Figure 1 1 shows the amino acid sequence (SEQ ID NO: 10) of the 

fusion protein APP(1-651)SW, TATexonI(MlL) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = TAT exon I; 7 = linker; 8 = amino acids 664-695 of 
10 APP. 

Figure 12A-B shows the DNA sequence (SEQ ID NO: 11) of the fusion 
protein APPQ-65 l)wt, TATexonI(MlL) APP (664-695). 

Figure 13 shows the amino acid sequence (SEQ ID NO: 12) of the 
fusion protein APP(l-651)wt, TATexonI(MlL) APP (664-695) with the various 
15 different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = TAT exon I; 7 = linker; 8 = amino acids 664-695 of 
APP. 

Figure 14A-C shows the DNA sequence (SEQ ID NO: 13) of the fusion 
20 protein APP(l-65 1)S W, GAL4-VP16(delMet) APP (664-695). 

Figure 15 shows the amino acid sequence (SEQ ID NO: 14) of the 
fusion protein APP(1-651)SW, GAL4-VP16(delMet) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
25 secretase cleavage; 5 = linker, 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APP. 

Figure 16A-C shows the DNA sequence (SEQ ID NO: 15) of the fusion 
protein APP(l-651)wt, GAL4-VP16(deIMet) APP (664-695). 

Figure 17 shows the amino acid sequence (SEQ ID NO: 16) of the 
30 fusion protein APPQ-65 l)wt, GAL4-VP16(delMet) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APP. 
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Figure 18A-B shows the cDNA sequence (SEQ ID NO: 17) and Figure 
18C shows the amino acid sequence (SEQ ID NO: 18) of the 695 amino acid splice 
variant of wild-type Alzheimer's precursor protein (APP). See GenBank accession 
no. Y00264 and Kang et aL, 1987, Nature 325:733-736. 
5 Figure 19 shows data from an embodiment in which the assay of the 

present invention was used to identify both a P-secretase inhibitor and a y-secretase 
inhibitor. See Example 3 for details. 

Figure 20 shows a schematic diagram of pCR2.1 Gal4-VP16. 

Figure 21 A shows a schematic diagram of pRBR121. Figure 21B 
10 shows a schematic diagram of pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
TATexonI(MlL) APP (664-695). 

Figure 22A shows a schematic diagram of pRBR186. Figure 22B 
shows a schematic diagram of the viral plasmid pNL4-3. Figure 22C shows a 
schematic diagram of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) 
15 APP (664-695) with additional details as compared to Figure 21B, which shows the 
same plasmid. 

Figure 23 shows a schematic diagram of pRSV Kan/Neo res. 

Figure 24 shows a schematic diagram of pUCd5TAT. 

Figure 25A shows a schematic diagram of pMM321. Figure 25B-D 
20 shows the nucleotide sequence of pMM321. The upper strand is SEQ ID NO: 19. The 
lower strand (SEQ ID NO:20) is the reverse complement of SEQ ID NO: 19. 

Figure 26A shows a schematic diagram of the expression vector 
pcDNA3.1 zeo (+)APP(1-651)SW, K612V-(MlL)TATexonI. This expression vector 
directs the expression of a fusion protein containing the first 651 amino acids of APP 
25 with the Swedish version of the p-secretase cleavage site and the K612V mutation 
fused to the first exon of HIV1 TAT. The methionine at position 1 of TAT has been 
changed to leucine. Figure 26B-G shows the nucleotide sequence of pcDNA3.1 zeo 
(+)APP(1-651)SW, K612V-(MlL)TATexonI. The upper strand is SEQ ID NO:21. 
The lower strand (SEQ ID NO:22) is the reverse complement of SEQ ID NO:21. 
30 Figure 27A-B shows a schematic diagram depicting general features of 

the present invention. Figure 27 A: The vertical bar represents a fusion protein with 
APP sequences represented as unfilled or lightly shaded portions of the bar. The 
lightly shaded portion represents A|3. "B ACE" indicates the P-secretase cleavage site. 
The dark shaded portion represents the transcription factor fused between APP 
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sequences. The horizontal bar represents a membrane in which the uncleaved fusion 
protein is embedded, e.g., the endoplasmic reticulum. Figure 27B: The transcription 
factor (plus small amounts of APP), having been released from the fusion protein and 
thus the membrane by APP processing, is shown in the nucleus binding to and 
5 activating the regulatory DNA sequence ('Transcription Factor Response Element") 
that controls the expression of the reporter gene. 

Figure 28A-B shows the DNA sequence (SEQ ID NO:23) of the fusion 
protein APP(1-651)NFEV, K612V-TATexonI(MlL) APP (664-695). 

Figure 29 shows the amino acid sequence of a fusion protein (APP(1- 

10 651)NFEV, K612V-TATexonI(MlL) APP (664-695)) (SEQ ID NO:24) containing 
the sequence NFEV at the P-secretase cleavage site (underlined at 2). The other 
portions of the fusion protein are indicated as follows: 1 = amino acids 1-651 of APP; 
2 = region of P-secretase cleavage; 3 = K612V mutation; 4 = region of y-secretase 
cleavage; 5 = linker; 6 = TAT exon I; 7 = linker; 8 = amino acids 664-695 of APP. 

15 Figure 30A-C shows the DNA sequence (SEQ ID NO:25) of the fusion 

protein APP(1-651)NFEV, K612V, GAU-VP16(delMet) APP (664-695). 

Figure 31 shows the amino acid sequence of a fusion protein (APP(1- 
651)NFEV, K612V, GAL4-VP16(delMet) APP (664-695)) (SEQ ID NO:26) 
containing the sequence NFEV at the P-secietase cleavage site (underlined at 2). The 

20 other portions of the fusion protein are indicated as follows: 1 = amino acids 1-651 of 
APP; 2 = region of p-secretase cleavage; 3 = K612V mutation; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APR 

Figure 32A shows a schematic diagram of pcDNA3.1 zeo (+), a 
25 eukaryotic expression vector that is suitable for use in the present invention. Figure 
32B-F shows the nucleotide sequence of pcDNA3.1 zeo (+). The upper strand is SEQ 
ID NO:27. The lower strand (SEQ ID NO:28) is the reverse complement of SEQ ED 
NO:27. 

Figure 33 shows data from an embodiment of the present invention 
30 utilizing a P-galactosidase reporter gene in which the assay of the present invention 
was used to identify both a P-secretase inhibitor and a y-secretase inhibitor. See 
Example 8 for details. 

Figure 34 shows data from an embodiment of the present invention in 
which a fusion protein having a wild-type p-secretase cleavage site and a fusion 
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protein having a Swedish P-secretase cleavage site are compared. See Example 9 for 
details. 

Figure 35A-B shows the DNA sequence (SEQ ID NO:29) of the fusion 
protein APP(1-651)NFEV, TATexonI(MlL) APP (664-695). 
5 Figure 36 shows the amino acid sequence of a fusion protein (APP(1- 

651)NEEV, TATexonI(MlL) APP (664-695)) (SEQ ID NO:30) containing the 
sequence NFEV at the (3-secretase cleavage site (underlined at 2) and a wild-type K at 
position 612 (underlined at 3). The other portions of the fusion protein are indicated 
as follows: 1 = amino acids 1-651 of APP; 2 = region of p-secretase cleavage; 3 = 
10 wild-type K; 4 = region of y-secretase cleavage; 5 = linker; 6 = TAT exon I; 7 = 
linker; 8 = amino acids 664-695 of APP. 

Figure 37A-C shows the DNA sequence (SEQ ID NO:31) of the fusion 
protein APP(1-651)NFEV, GAL4-VP16(delMet) APP (664-695). 

Figure 38 shows the amino acid sequence of a fusion protein (APP(1- 
15 651)NFEV, GAL4-VP16(delMet) APP (664-695)) (SEQ ID NO:32) containing the 
sequence NFEV at the p-secretase cleavage site (underlined at 2) and a wild-type K at 
position 612 (underlined at 3). The other portions of the fusion protein are indicated 
as follows: 1 = amino acids 1-651 of APP; 2 = region of P-secretase cleavage; 3 = 
wild-type K; 4 = region of y-secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = 
20 linker; 8 = amino acids 664-695 of APP. 



DETAILED DESCRIPTION OF THE INVENTION 
For the purposes of this invention: 

A "fusion protein" is a protein that contains at least two polypeptide 
25 regions and, optionally, a linking peptide to operatively link the two polypeptides into 
one continuous polypeptide. The at least two polypeptide regions in a fusion protein 
are derived from different sources, and therefore a fusion protein comprises two 
polypeptide regions not normally joined together in nature. 

A "Unking sequence (or linker peptide)" contains one or more amino 
30 acid residues joined in peptide bonds. A linking sequence serves to join two 

polypeptide regions of differing origins in a fusion protein via a peptide bond between 
the linking sequence and each of the polypeptide regions. 

Typically, a fusion protein is synthesized as a continuous polypeptide 
in a recombinant host cell which contains an expression vector comprising a 
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nucleotide sequence encoding the fusion protein where the different regions of the 
fusion protein are fused in frame on either side of a linker peptide's coding sequence. 
The chimeric coding sequence (encoding the fusion protein) is operatively linked to 
expression control sequences (generally provided by the expression vector) that are 
5 functional in the recombinant host cell. 

"Reporter gene," as used in the present invention, does not mean a 
DNA sequence present on the chromosome of a cell, generally possessing introns, as 
is often meant by the word "gene" in the art. Rather "reporter gene" means any DNA 
sequence encoding a protein or polypeptide that can give rise to a signal that can be 

10 detected or measured. "Reporter gene" does not mean a portion of the amino acid 
sequence of APP. "Reporter gene" will usually mean a DNA sequence, generally a 
cDNA sequence (although in some cases a reporter gene may have introns) that 
encodes a protein or polypeptide that is commonly used in the art to provide a 
measurable phenotype that can be distinguished over background signals. 

15 A "nuclear localization signal (NLS)" is a region of a polypeptide 

which targets the polypeptide to the nucleus of the cell. One such NLS is that from 
the SV40 large T antigen. See, e.g., U.S. Patent No. 5,589,392; Kalderon et al., 1984, 
Cell 39:499-509. The minimum region of the SV40 large T antigen with NLS activity 
is Pro-Lys-Lys-Lys-Arg-Lys-Val (SEQ ID NO:22). See also U.S. Patent No. 

20 5,776,689. 

"Substances" that are screened in the present invention can be any 
substances that are generally screened in the pharmaceutical industry during the drug 
development process. For example, substances may be low molecular weight organic 
compounds (e.g., having a molecular weight of less than about 2,000 daltons and 

25 preferably less than about 1,000 daltons), RNA, DNA, antibodies, peptides, or 

proteins. Substances are often tested in the methods of the present invention as large 
collections of substances, e.g. libraries of low molecular weight organic compounds, 
peptides, or natural products. 

The conditions under which substances are employed in the methods 

30 described herein are conditions that are typically used in the art for the study of 

protein-ligand interactions or enzyme inhibition studies: e.g., salt conditions such as 
those represented by such commonly used buffers as PBS or in tissue culture media; a 
temperature of about 4°C to about 55°C; incubation times of from several seconds to 
several hours or even up to 24 or 48 hours. Screening for the identification of 
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enzyme-specific inhibitors is a well-known procedure in the pharmaceutical arts and 
the numerous conditions under which such screening has been done are available in 
the literature to guide the practitioner of he present invention. 

A "conservative amino acid substitution" refers to the replacement of 
5 one amino acid residue by another, chemically similar, amino acid residue. Examples 
of such conservative substitutions are: substitution of one hydrophobic residue 
(isoleucine, leucine, valine, or methionine) for another; substitution of one polar 
residue for another polar residue of the same charge (e.g., arginine for lysine; glutamic 
acid for aspartic acid); substitution of one aromatic amino acid (tryptophan, tyrosine, 

10 or phenylalanine) for another. 

"Transfection" refers to any of the methods known in the art for 
introducing DNA into a cell, e.g., calcium phosphate or calcium chloride mediated 
transfection, electroporation, infection with a retroviral vector. 

The present invention relates to the discovery of an assay system that 

15 permits the simultaneous screening for inhibitors of several types of amyloid 
precursor protein (APP) processing or signaling (e.g., p-secretase cleavage, y- 
secretase cleavage, APP extracellular signaling, APP cytoplasmic signaling). In a 
preferred embodiment, this screening is accomplished without the concomitant 
identification of inhibitors of oc-secretase. The assay system is carried out in a single 

20 type of cell, using a single type of assay readout. Inhibitors discovered by means of 
the present invention are expected to be useful in the treatment of Alzheimer's disease 
since these inhibitors are likely to be capable of interfering with the production of A0. 

Previous assays for identifying inhibitors of APP processing have 
focussed specifically on inhibition of either P-secretase or y-secretase activity, or on 

25 inhibition of some other single aspect of A0 production. In contrast, the assays 

described herein are directed to inhibition of APP processing in general. Substances 
identified through these assays may target P-secretase, y-secretase, modulators of p- 
secretase or y-secretase activity, or even an as-yet-undiscovered ligand interaction 
with APP. In certain embodiments, these assays will also be free of the potentially 

30 misleading or obscuring effects of oc-secretase activity. In addition, unlike other 
assays currently in use, these assays are homogeneous assays; i.e., they require no 
cumbersome or time-consuming steps such as column chromatography separations, 
immunoprecipitations, washing steps, etc. Therefore, the assays are very well adapted 
to a high throughput screening format. 



-13- 



WO 03/072041 



PCT/US03/05458 



In the present invention, novel recombinant DNA molecules are 
constructed in which nucleotide sequences encoding at least a portion of the luminal 
(i.e., N-tenninal to the transmembrane region) and transmembrane regions of APP are 
fused to nucleotide sequences encoding a transcription factor. In a preferred 
5 embodiment, the APP contains an a-secretase cleavage site that has been altered to 
reduce or eliminate a-secretase cleavage. This allows the assays of the present 
invention to avoid identifying inhibitors of osecretase and permits the more efficient 
detection of P-secretase inhibitors since a-secretase and p-secretase compete for APP 
cleavage. The recombinant DNA molecules may be transfected, along with a reporter 

10 gene, into a cell line that processes APP into A0, and stable clones may be generated. 
Alternatively, the recombinant DNA molecules and reporter plasmid may be utilized 
in transient transfections. 

Upon expression in cells, the APP/transcription factor fusion protein 
localizes to a non-nuclear membrane of the cell (e.g., the endoplasmic reticulum) due 

15 to the presence of the APP sequences in the fusion protein. In a manner similar to 
cleavage of APP, the fusion protein will then be cleaved, first by P-secretase and then 
by y-secretase. y-secretase cleavage releases the transcription factor from the 
membrane in which the APP/transcription factor fusion protein had been embedded, 
after which the transcription factor translocates to the nucleus and stimulates 

20 transcription of the reporter gene. Assuming no a-secretase cleavage, cleavage by 
both P-secretase and y-secretase is required for release of the transcription factor and 
transactivation of the reporter gene in this assay since y-secretase cleavage of APP is 
dependent on a short luminal domain, such as that generated by P- or a-secretase 
cleavage. Detection of a signal from the reporter gene product will thus serve as 

25 evidence of APP processing. In particular, since activation of the reporter gene 

requires both P-secretase and y-secretase cleavage, the assay is capable of identifying 
inhibitors of both or either of these proteases. 

Figure 27 is a schematic diagram depicting general features of the 
assay. The vertical bar in Figure 27A represents the fusion protein; the horizontal bar 

30 represents the non-nuclear membrane in which the fusion protein is embedded before 
processing. Figure 27B shows how the transcription factor portion of the fusion 
protein (with small amounts of the APP portion flanking it) has moved to the nucleus 
following release from the fusion protein by APP processing. In the nucleus, the 
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transcription factor is shown binding to a regulatory DNA sequence ('Transcription 
Factor Response Element") and activating transcription of the reporter gene. 

The recombinant DNA molecules encoding the APP/transcription 
factor fusion protein and the reporter gene can be used to develop novel homogenous 
5 cell-based assays for the identification and assessment of inhibitors of APP processing 
which will be readily amenable to high throughput technology. 

In one embodiment, the recombinant DNA molecules used in this 
invention comprise sequences encoding the amino terminal 651 amino acids of the 
695 amino acid version of APP (Kang et al., 1987, Nature 325:733-736), including all 

10 the sequences necessary for the production of AP, as well as the C-terminal 32 amino 
acids of APP. The transcription factor is placed between the N-terminal and C- 
terminal portions of APP. The APP sequence may include a modification to increase 
the amount of P-secretase cleavage of the fusion protein. This modification involves 
mutating the K at position 612 of the a-secretase cleavage site to a V (K612V). Since 

15 a-secretase and P-secretase compete for APP cleavage, reducing or eliminating APP 
cleavage by a-secretase results in increased P-secretase cleavage, and allows the assay 
to detect P-secretase inhibitors more readily. In addition, the P-secretase cleavage site 
within APP (KMiDA) (SEQ ID NO:34) may be modified, e.g., to that of a naturally 
occurring mutation (termed the "Swedish" mutation or NLlDA) (SEQ ID NO:38) 

20 which has been shown to enhance P-secretase cleavage six-fold in cultured cells. 

Another possible modification is to replace the (KMiDA) (SEQ ID NO:34) wild-type 
P-secretase cleavage site with the sequence (NFiEV) (SEQ ID NO:40). The presence 
of NEEV in an amino acid sequence has been shown to enhance P-secretase cleavage 
by an even larger amount than the Swedish sequence. See U.S. Provisional Patent 

25 Application Serial No. 60/292,591 and U.S. Provisional Patent Application Serial No. 
60/316,115, the disclosures of which are incorporated herein, in their entirety. 

In a preferred embodiment, HIV-1 TAT exon I has been fused between 
sequences encoding the first 651 amino acids of APP695 and the last 32 amino acids 
of APP695 (APP-TAT-APPct32). Co-transfection of an expression vector comprising 

30 this construct with a reporter gene plasmid containing an HTV-1 LTR promoter that 
controls the transcription of a reporter gene leads to enhanced expression of the 
reporter gene. Other transcription factors that could be fused to APPi-651 include 
Gal4-VP16, the entire Gal4 protein, BIV TAT, HTV-2 TAT, SIV TAT, LexA-VP16, 
EBV Zta, Papillomavirus E2, or tissue or species specific homodimeric bHLH 
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transcription factors capable of activating transcription through specific DNA 
response elements, such as E12, E47, or Twist. The use of GAL4, BIV, HEV-2, or 
SIV TAT may be useful if it is desired to reduce the potency of the transactivator, thus 
reducing any background transactivation caused by non-specific cleavage of the fusion 
5 protein. To further reduce the potential for transactivation by TAT in the absence of 
p-secretase and y-secretase cleavage, the TAT portion of the fusion protein may be 
altered to remove the N-terminal methionine and thus eliminate the possibility of 
aberrant translation of TAT through any potential internal ribosomal entry sites. 

In some circumstances, high level expression of TAT has been found 

10 to be toxic to cells. Thus, when TAT is the transcription factor fused to APP in the 
methods of the present invention, it may be advantageous to utilize transient 
transfection with low amounts of the expression vector encoding the APP/TAT fusion 
protein. A set of preliminary experiments in which various amounts of the vector are 
transfected, in order to titrate acceptable levels of TAT, is recommended. 

15 The reporter gene used will depend in large part upon the transcription 

factor fused to APP. The promoter used to drive the reporter gene will be LTR for 
TAT-based APP fusion proteins, or UAS (6x) for GAL4-VP16-based APP fusion 
proteins. In a particular embodiment, an LTR driving EGFP (enhanced green 
fluorescent protein, a brighter variant of GEP made by Aurora Biosciences, San 

20 Diego, CA) has been used to observe processing of an APP/TAT fusion protein. 
Under certain conditions, it may be desirable to use a less stable reporter, such as 
dsEGFP (a destabilized variant of EGFP made by Aurora Biosciences, San Diego, CA 
and marketed by Clontech, Palo Alto, CA) or a more potent reporter, such as |3- 
lactamase. Alternatively, a stable HeLa cell line expressing LTR-|3-galactosidase can 

25 be used. If the exquisite sensitivity of (3-lactamase makes it less than optimal for a 
particular purpose, the LTR-fJ-galactosidase cell line may be exploited for this assay. 
Finally, under some circumstances Gal4-VP16 may prove to be optimal relative to 
TAT to reduce any inherent background problems associated with using the weakly 
but constitutively active LTR in the reporter plasmid, in which case the reporter 

30 plasmid could be UAS(6x)-(3-lactamase (Aurora Biosciences, San Diego, CA). 

A variety of cells are suitable for use in the methods of the present 
invention. Particularly preferred are eukaryotic, especially mammalian, cell lines. In 
particular embodiments, the cells are selected from the group consisting of: L cells L- 
M(TK") (ATCC CCL 1.3), L ceUs L-M (ATCC CCL 1.2), HEK293 (ATCC CRL 
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1573), HEK293T, Raji (ATCC CCL 86), CV-l (ATCC CCL 70), COS-1 (ATCC CRL 
1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), 
NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS- 
C-l (ATCC CCL 26), T24 (ATCC HTB-4), PC12 cells, Jurkat cells, H4 cells (ATCC 
5 HTB-148), and MRC-5 (ATCC CCL 171). 

To make the assay more amenable for ultra-high throughput screening, 
a non-adherent cell line, such as Jurkat, can be used. 

Generally, the assays of the present invention employ cells that 
naturally express |3-secretase and y-secretase. However, it is possible to practice the 

10 invention in cells that lack the expression of one, or both, of these enzymes. In such 
cases, P-secretase and y-secretase activity can be provided by the recombinant 
expression of these enzymes in the cells. 

In one embodiment, the present invention provides a recombinant cell, 
preferably a eukaryotic cell, even more preferably a mammalian cell, and most 

15 preferably a human cell, where the cell expresses a fusion protein of APP and a 

transcription factor and the cell contains a reporter gene that can be activated by the 
transcription factor. The fusion protein comprises a portion of APP where that portion 
includes the regions of the P-secretase and y-secretase cleavage sites fused to a 
transcription factor. The region of APP including the P-secretase and y-secretase 

20 cleavage sites can be, e.g., a portion of APP that includes amino acids 589-651 of the 
695 amino acid version of APP. This region is shown below. 

| 11 
EEISEVKM DAEFRJEIDSGYEVHHQK^ LA 
25 TVIVITLVMLKKK (SEQ ID NO:33) 

i 

The P-secretase cleavage site is shown at position 596-597 (KM DA) (SEQ ID 
NO:34). 

Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
30 638-639 

i i 

(GW IA TV)(SEQIDNO>35). 

The fusion protein will be anchored in the membrane by the APP 
35 sequences shown above. The N-terminal portion of APP must include at least the 0- 
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secretase cleavage site, and possibly several amino-acids N-terminal to the P-secretase 
cleavage site to make the assay sensitive to both P-secretase and y-secretase inhibitors. 
In many cases, the APP sequences will include sequences further N-terminal than 
those shown above, including the signal sequence at the N-terminus of APP. In cases, 

5 where the APP signal sequence is not used, another signal sequence may be 

incorporated in the fusion protein. Such other signal sequences are known in the art. 

In a related embodiment, the present invention provides a recombinant 
cell, preferably a eukaryotic cell, even more preferably a mammalian cell, and most 
preferably a human cell, where the above-described APP portion of the fusion protein 

10 contains a K612V mutation. The APP portion of this embodiment is shown below. 

i I I 

EECSEVKM DAEFRHDSGYEVIfflQY^ IA 
TVIVTILVMLKKK (SEQ ID NO:36) 

15 I 

The p-secretase cleavage site is shown at position 596-597 (KM DA) (SEQ ID 
NO:34). 

Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
638-639 

20 11 

(GW IA TV) (SEQ ID NO:35). 
The underlined V at position 612 shows the change in sequence in the present 
invention from the wild-type K to the mutant V, which change provides for reduced 
cleavage by a-secretase. 

25 In a related embodiment, the present invention provides a recombinant 

cell, preferably a eukaryotic cell, even more preferably a mammalian cell, and most 
preferably a human cell, where the above-described APP portion of the fusion protein 
contains the Swedish version of the P-secretase cleavage site as well as a K612V 
mutation. The APP portion of this embodiment is shown below. 

30 

i 1 i 

EEISEVNL DAEFRHDSGYEVHHQVLV^ IA 
TVTVTTLVMLKKK (SEQ ID NO:37) 

I 

35 The p-secretase cleavage site is shown at position 596-597 (NL DA) (SEQ ID 
NO:38). 
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Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
638-639 

I I 

(GW IA TV)(SEQIDNO:35). 
5 The underlined V at position 612 shows the change in sequence in the present 

invention from the wild-type K to the mutant V, which change provides for reduced 
cleavage by oc-secretase. 

In a related embodiment, the present invention provides a recombinant 
cell, preferably a eukaryotic cell, even more preferably a mammalian cell, and most 
10 preferably a human cell, where the above-described APP portion of the fusion protein 
contains the NFEV version of the p-secretase cleavage site as well as a K6 12V 
mutation. The APP portion of this embodiment is shown below. 

15 EEISEVNF EVEFRHDSGYEVHHQVLVFFAEDVGSNK IA 
TVIWLVMLKKK (SEQ ID NO:39) 

I 

The P-secretase cleavage site is shown at position 596-597 (NF EV) (SEQ ID 
NO:40). 

20 Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
638-639 

I i 

(GW IA TV)(SEQIDNO:35). 
The underlined V at position 612 shows the change in sequence in the present 

25 invention from the wild-type K to the mutant V, which change provides for reduced 
cleavage by oc-secretase. 

The presence of both P-secretase and y-secretase cleavage sites in the 
fusion proteins permits the assays of the present invention to detect inhibitors of both 
P-secretase and y-secretase. 

30 The recombinant host cells of the present invention can be further 

engineered to comprise a reporter gene construct. The reporter gene construct 
contains a reporter gene in operable linkage with a regulatory DNA sequence that 
confers on the reporter gene the property of being regulated by the transcription factor 
of the fusion protein. This regulation is such that expression of the reporter gene is 

35 low or absent without binding of the transcription factor to the regulatory DNA 
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sequence but, when the transcription factor is released from the fusion protein by APP 
processing, the transcription factor can move into the nucleus of the cell and bind to 
the regulatory DNA sequence, thereby activating transcription from the reporter gene. 
Reporter genes desirably give rise to gene products which can be 
5 detected or quantitated, either in terms of amount of protein synthesized, enzymatic 
activity, fluorescence, luminescence, or some other phenotype. Suitable reporter gene 
products include firefly luciferase (de Wet et al., 1987, MoL Cell. Biol. 7:725-737) or 
bacterial luciferase (Englebrecht et al., 1985, Science 227:1345-1347; Baldwin et al., 
1984, Biochem. 23:3663-3667), ^-lactamase, ^-glucuronidase, p-galactosidase, green 

10 fluorescent proteins, enhanced green fluorescent protein, destabilized enhanced green 
fluorescent protein, red fluorescent protein, yellow fluorescent protein, cyan 
fluorescent protein, destabilized yellow fluorescent protein, destabilized cyan 
fluorescent protein, aequorin, chloramphenicol acetyl transferase (Alton & Vapnek, 
1979, Nature 282:864-869), rat liver alkaline phosphatase (Toh et al., 1989, Eur. J. 

15 Biochem. 182:231-237), human placental secreted alkaline phosphatase (Cullen & 
Mallim, 1992, Meth. Enzymol. 216:362-368), and horseradish peroxidase, among 
others. 

A preferred reporter gene is green fluorescent protein (GFP) or a 
modified GFP. Wild-type GFP has long been used in the art. Starting from green 

20 fluorescent protein, many modified versions have been derived with altered or 

enhanced spectral properties as compared with wild-type GFP. See, e.g., U.S. Patent 
No. 5,625,048; International Patent Publication WO 97/28261; International Patent 
Publication WO 96/23810. Useful are the modified GFPs W1B and TOPAZ, 
available commercially from Aurora Biosciences Corp., San Diego, CA. W1B 

25 contains the following changes from the wild-type GFP sequence: F64L, S65T, 

Y66W, N146I, M153T, and V163A (see Table 1, page 519, of Tsien, 1998, Ann. Rev. 
Biochem. 67:509-544). TOPAZ contains the following changes from the wild-type 
GFP sequence: S65G, V68L, S72A, and T203Y (see Table 1, page 519, of Tsien, 
1998, Ann. Rev. Biochem. 67:509-544). Wild-type nucleotide and amino acid 

30 sequences of GFP are shown in Figure 1 and SEQ ID NO: 1 of International Patent 
Publication WO 97/28261; in Figure 1 of Tsien, 1998, Ann. Rev. Biochem. 67:509- 
544; and in Prasher et al., 1992, Gene 111:229-233. 

When expressing GFPs in mammalian cells, it may be advantageous to 
construct versions of the GFPs having altered codons that conform to those codons 
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preferred by mammalian cells (Zolotukhin et al., J. Virol. 1996, 70:4646-46754; Yang 
et al., 1996, Nucl. Acids Res. 24:4592-4593). Another way of improving GFP 
expression in mammalian cells is to provide an optimal ribosome binding site by the 
use of an additional codon immediately after the starting methionine (Crameri et al., 

5 1996, Nature Biotechnology 14:315-319). 

Transcription factors that are useful in the present invention are 
preferably those transcription factors that are not naturally expressed in the 
recombinant host cells. This is so the regulatory DNA sequence is not activated 
absent APP processing and release of the transcription factor from the fusion protein. 

10 Preferably, the transcription factor contains, or is engineered to contain, a nuclear 
localization signal. This is so that, after release, the transcription factor will move 
into the nucleus of the genetically modified host cells where it can bind to, and 
activate, the regulatory DNA sequence, leading to expression of the reporter gene. 
Transcription factors, as used in the present invention, do not include proteins that, 

15 after release from a fusion protein and translocation into the nucleus, repress 
transcription from a reporter gene. 

Among the transcription factors that are useful in the present invention 
are: HIV1 TAT (in particular exon I of HIV1 TAT), Gal4-VP16, the entire Gal4 
protein, BIV TAT, HIV-2 TAT, SIV TAT, LexA-VP16, EBV Zta, Papillomavirus E2, 

20 or one of the bHLH homodimeric transcription factors, E12, E47, or Twist. 

Expression vectors are generally used to express the fusion protein in 
the recombinant cells. An expression vector contains recombinant nucleic acid 
encoding a polypeptide (e.g., an APP/transcription factor fusion protein) along with 
regulatory elements for proper transcription and processing. Generally, the regulatory 

25 elements that are present in an expression vector include a transcriptional promoter, a 
ribosome binding site, a transcriptional terminator, and a polyadenylation signal. 
Other elements may include an origin of replication for autonomous replication in a 
host cell, a selectable marker, a limited number of useful restriction enzyme sites, and 
a potential for high copy number. 

30 A variety of expression vectors are known in the art and can be used in 

the present invention. Commercially available expression vectors which are suitable 
include, but are not limited to, pMClneo (Stratagene), pSG5 (Stratagene), pcDNAI 
and pcDNAIamp, pcDNA3, pcDNA3.1, pCR3.1 (Invitrogen, San Diego, CA), EBO- 
pSV2-neo (ATCC 37593), pBPV-l(8-2) (ATCC 37110), pdBPV-MMTneo(342-12) 
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(ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pCLneo 
(Promega), pTRE (Clontech, Palo Alto, CA), pVUneo, plRESneo (Clontech, Palo 
Alto, CA), pCEP4 (Invitrogen, San Diego, CA), pSCll, and pSV2-dhfr (ATCC 
37146). The choice of vector will depend upon the cell type in which it is desired to 

5 express the APP/transcription factor fusion protein, as well as on the level of 
expression desired, and the like. 

The expression vectors can be used to transiently express or stably 
express the fusion protein. The transient expression or stable expression of 
transfected DNA is well known in the art. See, e.g., Ausubel et al., 1995, 

10 "Introduction of DNA into mammalian cells," in Current Protocols in Molecular 
Biology, sections 9.5.1-9.5.6 (John Wiley & Sons, Inc.). 

The recombinant host cells of the present invention are useful in 
methods of screening substances for the ability to inhibit APP processing. In one 
embodiment, the methods of the present invention comprise adding a candidate 

15 substance to a recombinant host cell comprising an APP/transcription factor fusion 
protein and a reporter gene and comparing the level of expression of the reporter gene 
protein in the presence and absence of the candidate substance, wherein the level of 
expression of the reporter gene protein is lower when the candidate substance inhibits 
processing of the APP/transcription factor fusion protein such that the transcription 

20 factor is not released, or is released in a lower amount, than in the absence of the 
substance. 

The level of expression of the reporter gene protein is generally not 
measured directly. Rather, an indirect method is used. For example, fluorescence 
given off by the reporter gene protein may be detected or measured as, e.g., when the 

25 reporter gene product is a green fluorescent protein; or, some enzymatic activity of the 
reporter gene product may be detected or measured, e.g., when the reporter gene 
product is P-lactamase. 

The candidate substance may be of any form suitable for entry into the 
cytoplasm of the recombinant cell or for contact with the cell's cytoplasmic 

30 membrane. Under appropriate conditions, the candidate substance may be allowed to 
freely diffuse into the cell, or the delivery of the substance may be facilitated by 
techniques and substances which enhance cell permeability, a wide variety of which 
are known in the art. Methods for increasing cell permeability include, without 
limitation, the use of organic solvents such as dimethylsulfoxide, liposomes, 
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application of electrical current, and physical means such as substance-coated teflon 
pellets. 

The present invention provides a method of identifying a substance that 
inhibits APP processing comprising: 
5 (a) providing a recombinant eukaryotic cell which: 

(i) expresses a fusion protein comprising amino acids 589- 
651 of APP695 and a transcription factor where the transcription factor is fused in 
frame to the carboxyl terminus of amino acids 589-651 of APP695; and 

(ii) comprises a reporter gene operably linked to a regulatory 
10 DNA sequence which is capable of being activated by the transcription factor; 

(b) measuring the level of reporter gene product in the cell in the 
absence of the substance; 

(c) adding the substance to the cell and measuring the level of 
reporter gene product in the cell in the presence of the substance; 

15 where a decrease in the level of reporter gene product in the presence 

as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

The manner in which the level of the reporter gene product is measured 
will be determined by the nature of the reporter gene and, often, the characteristics of 

20 the host cell. For example, if the reporter gene product itself is fluorescent, as for 
example, when a green fluorescent protein is the reporter gene product, fluorescence 
from the cell can be measured directly. When the reporter gene product has enzymatic 
activity, for example, when the reporter gene product is (3-lactamase, known methods 
of measuring that enzymatic activity can be used. 

25 For the sake of clarity, the above method is described in terms of "a" 

cell. In actual practice, the method will generally be carried on a large number of cells 
at one time. For example, the method will often be carried out in a well of a tissue 
culture plate, where, depending on the number of wells in the plate (and thus their 
size), there can be up to hundreds, thousands, or even several million cells. The step 

30 of "adding the substance to the cell" is generally carried out by simply adding the 
substance to the tissue culture medium in which the cells are present. After the 
substance is added to the cell, the cell and the substance are incubated for a period of 
time sufficient for the substance to inhibit APP processing, if the substance is actually 
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an inhibitor of APP processing. This period is usually from about 15 minutes to 48 
hours, but may be somewhat more in unusual cases. 

A convenient way of carrying out the method is to grow a population 
of the recombinant eukaryotic cells and then split the population into a portion that 
5 will be exposed to the substance and a portion that will not be exposed to the 
substance. 

The recombinant eukaryotic cell is generally produced by transfection 
of an expression vector encoding the fusion protein and by transfection of a plasmid 
containing the reporter gene. 

10 One skilled in the art would recognize that what is sought in terms of 

"a decrease in the level of reporter gene product in the presence as compared to the 
absence of the substance" is a non-trivial decrease. For example, if in the method 
described above there is found a 1% decrease, this would not indicate that the 
substance is an inhibitor of APP processing. Rather, one skilled in the art would 

15 attribute such a small decrease to normal experimental variance. What is looked for is 
a significant decrease. For the purposes of this invention, a significant decrease 
fulfills the usual requirements for a statistically valid measurement of a biological 
signal. For example, depending upon the details of the embodiment of the invention, 
a significant decrease might be a decrease of at least 10%, preferably at least 20%, 

20 more preferably at least 50%, and most preferably at least 90%. 

In particular embodiments, amino acids 589-651 of APP695 contain a 

K612V mutation. 

In particular embodiments, the cell is a mammalian cell. In particular 
embodiments, the cell is a human cell. 
25 In particular embodiments, the method is used to screen a library of 

more than 1,000 substances. In other embodiments, the method is used to screen a 
library of more than 50,000 substances at a rate of more than 1,000 substances per 24 
hours. 

In particular embodiments, the fusion protein comprises a portion of 
30 APP that is selected from the group consisting of: amino acids 1-651 of APP695, 
amino acids 50-651 of APP695, amino acids 100-651 of APP695. amino acids 150- 
651 of APP695, amino acids 200-651 of APP695, amino acids 250-651 of APP695, 
amino acids 300-651 of APP695, amino acids 350-651 of APP695, amino acids 400- 
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651 of APP695, amino acids 450-651 of APP695, amino acids 500-651 of APP695, 
and amino acids 550-651 of APP695- 

In related embodiments, the fusion protein does not comprise all of 
amino acids 589-651 of APP695- Rather, the fusion protein comprises slightly fewer 
5 amino acids from APP. For example, the fusion protein might comprise slightly fewer 
amino acids of the (3-secretase cleavage site: e.g., amino acids 590-651 of APP695- 
Or the fusion protein might comprise slightly fewer amino acids of the y-secretase 
cleavage site: amino acids 589-650 of APP695; amino acids 589-649 of APP695; 
amino acids 589-648 of APP695; or amino acids 589-647. The fusion protein may 

10 even comprise slightly fewer amino acids from both ends, e.g., amino acids 590-647 
of APP695- What is important is that the portion of APP included in the fusion 
protein contains both the (3-secretase cleavage site and the y-secretase cleavage site. 

In particular embodiments, the transcription factor is selected from the 
group consisting of: HIV-1 TAT, Gal4-VP16, the entire Gal4 protein, LexA-VP16, 

15 EB V Zta, Papillomavirus E2, one of the bHLH homodimeric transcription factors, 
including E12, E47, or Twist, or BIV TAT, HTV-2 TAT, or SIV TAT. A particular 
version of HTV-1 TAT suitable for use in the present invention is HIV-1 TAT exon I. 

Fusion proteins suitable for use in the present invention can be selected 
from the group consisting of: APP(1-651)SW, K612V-TATexonI(MlL) APP (664- 

20 695) (SEQ ID NO:2); APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695) (SEQ 
ID NO:4); APP(1-651)SW, K612V, Gal4-VP16(M1L) APP (664-695) (SEQ ID 
NO:6); APP(l-651)wt, K612V, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO:8); 
APP(1-651)SW, TATexonI(MlL) APP (664-695) (SEQ ID NO: 10); APP(l-651)wt, 
TATexonI(MlL) APP (664-695) (SEQ ID NO: 12); APP(1-651)SW, Gal4- 

25 VP16(M1L) APP (664-695) (SEQ ID NO:14); APP(l-651)wt, Gal4-YP16(M1L) APP 
(664-695) (SEQ ID NO:16); APP(1-651)NFEV, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID NO:23); and APP(1-651)NFEV, K612V, GADt-VP16(MlL) APP 
(664-695) (SEQIDNO:25). 

In some embodiments of the present invention, the amino acid 

30 sequences contributed to the fusion protein by the transcription factor constitute the 
carboxy terminal amino acid sequences of the fusion protein. In other embodiments, 
the transcription factor has other sequences fused to its carboxy terminus, as in the 
examples herein where amino acids 664-695 of APP695 are fused to the carboxy 

terminus of the transcription factor and therefore constitute the carboxy terminal 
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amino acid sequences of the fusion protein. Other portions of APP (e.g., amino acids 
652-695 of APP695) could be used instead of amino acids 664-695 of APP695- In 
fact, it should be possible to extend the carboxy terminus of the transcription factor 
with almost any amino acid sequences, providing such sequences do not interfere with 

5 the ability of the transcription factor to move into the nucleus and activate 

transcription of the reporter gene once the transcription factor has been released from 
the fusion protein by the action of y-secretase. 

The present invention includes a method of identifying a substance that 
inhibits APP processing comprising: 

10 (a) providing a recombinant eukaryotic cell which: 

(i) expresses a fusion protein comprising an amino acid 
sequence from APP that is capable of being cleaved by both 0-secretase and y- 
secretase and a transcription factor where the transcription factor is fused in frame to 
the amino acid sequence from APP; and 

15 (ii) comprises a reporter gene operably linked to a 

regulatory DNA sequence which capable of being activated by the transcription factor; 

(b) measuring the level of reporter gene product in the cell in the 
absence of the substance; 

(c) adding the compound to the cell and measuring the level of 
20 reporter gene product in the cell in the presence of the substance; 

where a decrease in the level of reporter gene product in the presence 
as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

In particular embodiments, the amino acid sequence from APP 

25 comprises: 

589- 651 of APP695 

590- 651 of APP695 

589- 650 of APP695 

590- 650 of APP695 
30 589-649 of APP695 

590-649 of APP695 

589- 648 of APP695 

590- 648 of APP695 
589-647 of APP695; or 
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590-647 of APP695. 

In related embodiments, the amino acid sequence from APP contains 
the amino acid sequence NLDA (SEQ ID NO:38) at the P-secretase cleavage site 
instead of the wild-type sequence KMDA (SEQ ID NO:34). 
5 In related embodiments, the amino acid sequence from APP contains 

the amino acid sequence NFEV (SEQ ID NO:40) at the p-secretase cleavage site 
instead of the wild-type sequence KMDA (SEQ ID NO:34). 

The portion of the fusion protein that is derived from APP may contain 
mutations that are known in the art. Of particular interest are mutations that result in 
10 an increased proportion of AP being made in the form of Api-42 rather than Api-40. 
Such mutations are disclosed in the following publications (numbering is from the 
770 amino acid version of APP): 

Swedish (K670N, M671L): Mullan et al., 1992, Nature Genet. 1:345-347. 

Flemish (A692G): Hendriks et al., 1992, Nature Genet. 1:218-221; Cras et al., 1998, 
15 Acta Neuropathol. (Berlin) 96:253-260. 

Dutch (E693Q): Levy et al., 1990, Science 248:1124-1126. 

Arctic (E693G): Nilsberth et al., 2001, Nature Neuroscience 4: 887-893. 

Austrian (T714I): Kumar-Singh et al., 2000, Hum. Mol. Genet. 9:2589-2598. 

French (V715M): Ancolio et al., 1999, Proc. Natl. Acad. Sci. (USA) 96:4119-4124. 
20 Florida (I716V): Eckman et al., 1997, Hum. Mol. Genet. 6:2087-2089. 

V717F: Murrell et al., 1991, Science 254:97-99. 

V717G: Chartier-Harlin et al., 1991, Nature 353:844-846. 

London (V717I): Goate et al., 1991, Nature 349:704-706. 

L723P: Kwok et al., 2000, Ann. Neurol. 47:249-253. 
25 I716F (also called I45F, referring to the position relative to the P-secretase cleavage 

site): This mutation in APP changes processing of Ap almost exclusively to Api-42. 

Iichtenthaler et al., 1999, Proc. Natl. Acad. Sci. (USA) 96:3053-3058. 

As with many proteins, it may be possible to modify many of the 

amino acids of the fusion proteins described above and still retain substantially the 
30 same biological activity in terms of APP processing as for the original fusion protein. 

Thus, the present invention includes modified fusion proteins which have amino acid 

deletions, additions, or substitutions but that still retain substantially the same 

properties with respect to APP processing as the fusion proteins described herein. It is 

generally accepted that single amino acid substitutions do not usually alter the 
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biological activity of a protein (see, e.g., Molecular Biology of the Gene , Watson et 
al.> 1987, Fourth Ed., The Benjanrin/Cummings Publishing Co., Inc., page 226; and 
Cunningham & Wells, 1989, Science 244: 1081-1085). Accordingly, the present 
invention includes fusion proteins where one amino acid substitution has been made 

5 in the fusion proteins described herein where the fusion proteins still retain 

substantially the same properties with respect to APP processing as the fusion proteins 
described herein. The present invention also includes fusion proteins where two or 
more amino acid substitutions have been made in the fusion proteins described herein 
where the fusion proteins still retain substantially the same properties with respect to 

10 APP processing as the fusion proteins described herein. In particular, the present 
invention includes embodiments where the substitutions are conservative 
substitutions. 

With the exception of Figure 18, the nucleotide and amino acid 
sequences of APP disclosed herein contain a minor difference compared to APP 

15 sequences that are usually reported in the literature. For the sequences disclosed 

herein with such a difference, the nucleotide at position 367 is an A rather than a G, as 
in most published APP sequences. This change results in a conservative substitution 
in the corresponding APP amino acid sequence. Thus, the amino acid sequences 
disclosed herein with such a difference have an I rather than a V at position 123. This 

20 difference does not affect the properties of the fusion proteins for the purposes of the 
present invention. Therefore, fusion proteins having the APP sequence reported in the 
literature with an G at nucleotide position 367 and a V at amino acid position 123 and 
the fusion proteins disclosed herein with an A at nucleotide position 367 and an I at 
amino acid position 123 are to be considered equivalents for the purposes of the 

25 present invention. 

The Gal-VP16 sequences disclosed herein contain two changes from 
the usual published sequences. There is T to C change at nucleotide position 2131 
that causes a S to P change at amino acid position 712; there is A to C change at 
nucleotide position 2301 that does not change the amino acid sequence. It is expected 

30 that Gal-VP16 proteins containing the usual sequences reported in the literature will 
also be suitable for use in the present invention. 

The methods of the present invention can be used to screen libraries of 
substances or other sources of substances to identify substances that are inhibitors of 
P-secretase or Y-secretase. Such identified inhibitory substances can serve as "leads" 
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for the development of pharmaceuticals that can be used to treat patients having 
Alzheimer's disease or in a prophylactic manner to prevent or delay the development 
of Alzheimer's disease. Such leads can be further developed into pharmaceuticals by, 
for example, subjecting the leads to sequential modifications, molecular modeling, 
5 and other routine procedures employed in the pharmaceutical industry. The inhibitors 
of APP processing identified by the present invention may also be tested in animal 
models of Alzheimer's disease such as the various transgenic mouse models that are 
known in the art. 

Although a wide variety of substances can be screened by the methods 

10 of the present invention, preferred substances for screening are libraries of small 
molecule compounds. Small molecule compounds are preferred because they are 
more readily absorbed after oral administration, have fewer potential antigenic 
determinants, and are more likely to cross the blood/brain barrier than larger 
molecules such as nucleic acids or proteins. 

15 Once identified by the methods of the present invention, the candidate 

small molecule compounds may then be produced in quantities sufficient for 
pharmaceutical testing and formulated in a pharmaceutically acceptable carrier (see, 
e.g., Remingto n's Pharmaceutical Sciences , Gennaro, A., ed., Mack Publishing, 1990, 
for suitable methods). The candidate compounds may be administered to cell lines 

20 relevant to Alzheimer's disease, animal models of Alzheimer's disease, or 
Alzheimer's disease patients. 

The numbering of the amino acids in APP used herein is based on the 
695 amino acid version of APP described in Kang et al., 1987, Nature 325:733-736. 
There are two other major versions of APP, having 751 amino acids and 770 amino 

25 acids (see, Ponte et al., 1988, Nature 331:525-527 for the 751 amino acid version and 
Kitaguchi et al., 1988, Nature 331:530-532 for the 770 amino acid version). One 
skilled in the art will understand how to translate the numbering used herein, based on 
the 695 amino acid version of APP, into the corresponding numbering for other 
versions of APP. For example, some of the APP/transcription factor fusion proteins 

30 of the present invention contain the K612V mutation, based on the numbering of the 
695 amino acid version. This mutation would correspond to a K668V mutation in the 
751 amino acid version and a K687V mutation in the 770 amino acid version. 
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Therefore, when a "K612V" mutation is referred to herein, it will be 
understood that such reference also includes a K668V mutation of the 751 amino acid 
version of APP as well as a K687V mutation of the 770 amino acid version of APP. 

Similarly, the portion of APP referred to as APPi-651 herein, based on 
5 the 695 amino acid version, will be understood to mean also APPl-707 of the 751 
amino acid version and APP 1-726 of the 770 amino acid version. 

If desired, inhibitors that are identified by the methods of the present 
invention can be further tested to determine which step in APP processing they affect. 
Assays that are known to be specific for the various steps of APP processing can be 

10 used for this purpose. For example, the assay of Karlstrom et al., (Journal of 

Biological Chemistry papers in press, published on December 13, 2001 as Manuscript 
C 100649200) is only capable of detecting inhibitors of y-secretase and cannot also 
detect inhibitors of other steps of APP processing such as, e.g., inhibitors of P- 
secretase. If an inhibitor identified by the methods of the present invention is found to 

15 also be an inhibitor when tested in the assay of Karlstrom et al., then that inhibitor is 
at least a y-secretase inhibitor. It is still possible that that inhibitor could inhibit other 
steps in APP processing as well. Further tests known in the art can determine this. 

The present invention may be modified so as to provide methods of 
determining at which step of APP processing a known inhibitor of APP processing 

20 exerts its effect. The known inhibitor may be one that has been identified by the 
methods of the present invention or by some other method. The modification to the 
present invention consists in mutating the p-secretase site in a fusion protein so that P- 
secretase cleavage can no longer occur at the site or occurs at a very much reduced 
level. Providing that the fusion protein contains a cleavable oc-secretase site, the 

25 fusion protein can still be used in the methods of the present invention. However, this 
fusion protein (with a mutated P-secretase site) can no longer detect P-secretase 
inhibitors. Therefore, if the known APP processing inhibitor still functions as an APP 
processing inhibitor in this modified version of the invention, then the known 
inhibitor cannot be a P-secretase site inhibitor but instead must exert its effect 

30 downstream of P-secretase. 

Suitable mutations of the P-secretase site include the following. All 
the sequences are for amino acid positions 594-598 of APP695- 

VNFAV (SEQ ID NO:41): This mutation shows decreased P-secretase cleavage 
relative to the wild type, KMDA (SEQ ID NO:34), sequence. 
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VKVDA (SEQ ID NO:42): Vassar et al., 1999, Science 286:735-741. This mutant 
was tested in vitro only, but purified P-secretase failed to cleave a 30-amino acid 
peptide containing this sequence. 

WKMDA (SEQ ID NO:43), VKADA (SEQ ID NO:44), VKKDA (SEQ ID NO:45), 

5 VKEDA (SEQ ID NO:46), VKEDA (SEQ ID NO:47), VKMIA (SEQ ID NO:48), 
VKMNA (SEQ ID NO:49), VKMEA (SEQ ID NO:50), VKMDE (SEQ ID NO:51), 
VKMDK (SEQ ID NO:52): Citron et al., 1995. Neuron 14:661-670. These mutations 
decreased Ap production 4-20X relative to p3 production in cultured cells. 

Fusion proteins can be constructed by use of the polymerase chain 

10 reaction (PCR) to amplify desired portions of APP and transcription factors, which 
can be then be cloned into expression vectors by methods well known in the art. 
Primers for PCR will generally include a small part of the APP or transcription factor 
as well as convenient cloning sites and/or linker peptide sequences. The PCR primers 
can be used to amplify the desired APP or transcription factor fragments from sources 

15 such as previously cloned APP or transcription factors, cDNA libraries, or genomic 
libraries. The amplified APP and transcription factor sequences can be cloned into 
suitable expression vectors. Methods of PCR and cloning are well known in the art 
and can be found in standard reference materials such as those listed below. 

Standard techniques for cloning, DNA isolation, amplification and 

20 purification, for enzymatic reactions involving DNA ligase, DNA polymerase, 

restriction endonucleases and the like, and various separation techniques are known 
and commonly employed by those skilled in the art. A number of standard techniques 
are described in Sambrook et al. (1989) Molecular Cloning, Second Edition, Cold 
Spring Harbor Laboratory, Plainview, N.Y.; Maniatis et al. (1982) Molecular Cloning, 

25 Cold Spring Harbor Laboratory, Plainview, N. Y.; Wu (ed.) (1993) Meth. Enzymol. 
218, Part I; Wu (ed.) (1979) Meth. Enzymol. 68; Wu et al. (eds.) (1983) Meth. 
Enzymol. 100 and 101; Grossman and Moldave (eds.) Meth. Enzymol. 65; Miller 
(ed.) (1972) Experiments in Molecular Genetics, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, N.Y.; Old and Primrose (1981) Principles of Gene Manipulation, 

30 University of California Press, Berkeley; Schleif and Wensink (1982) Practical 

Methods in Molecular Biology; Glover (ed.) (1985) DNA Cloning Vol. I and n, IRL 
Press, Oxford, UK; Hames and Higgins (eds.) (1985) Nucleic Acid Hybridization, IRL 
Press, Oxford, UK; Sedow and Hollaender (1979) Genetic Engineering: Principles 
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and Methods, Vols. 1-4, Plenum Press, New York, and Ausubel et aL (1992) Current 
Protocols in Molecular Biology, Greene/Wiley, New York, N.Y.. 

PCR reactions can be carried out with a variety of thermostable 
enzymes including but not limited to AmpliTaq, AmpliTaq Gold, or Vent polymerase. 

5 For AmpliTaq, reactions can be carried out in 10 mM Tris-Cl, pH 8.3, 2.0 mM 
MgCl2, 200 jxM of each dNTP, 50 mM KC1, 0.2 fiM of each primer, 10 ng of DNA 
template, 0.05 units//*l of AmpliTaq. The reactions are heated at 95°C for 3 minutes 
and then cycled 35 times using suitable cycling parameters, including, but not limited 
to, 95°C, 20 seconds, 62°C, 20 seconds, 72°C, 3 minutes. In addition to these 

10 conditions, a variety of suitable PCR protocols can be found in PCR Primer. A 
Laboratory Manual , edited by C.W. Dieffenbach and G.S. Dveksler, 1995, Cold 
Spring Harbor Laboratory Press; or PCR Protocols: A Guide to Methods and 
Applications , Michael et aL, eds., 1990, Academic Press. 

It is desirable to sequence the DNA encoding the fusion proteins, or at 

15 least the junction regions of the various portions (APP, transcription factor, linkers) of 
the fusion protein in order to verify that the desired portions have in fact been 
obtained, joined properly, and that no unexpected changes have been introduced into 
the sequences by the PCR reactions. 

Suitable PCR primers for amplification of DNA sequences for use in 

20 the present invention can be readily designed by those of skill in the art. Examples of 
such primers are shown below. 

5'-GGA GAG GAT ATC ATG GAG CCA GTA GAT CC-3' (SEQ ID NO:53) can be 
used to amplify the 5' portion of IDV-1 TAT exon L 

25 

5'-TAC ATG GCG GCC GCC TAC TTA CTG CTT TG-3' (SEQ ID NO:54) can be 
used to amplify the 3' portion of EDV-1 TAT exon I. 

5'-GGA TGT GAT ATC TTT CTT CTT CAG CAT CAC CAA GG-3' (SEQ ID 
30 NO:55) can be used to amplify the 3' portion of DNA encoding amino acids 1-651 of 
APP, i.e., the transmembrane region of APP. 

The following non-limiting examples are presented to better illustrate 

the invention. 



-32- 



WO 03/072041 



PCT/US03/05458 



EXAMPLE 1 

Transfection of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI with 
pMM321 _ 

5 The following example demonstrated that an APP/TAT fusion 

construct will transactivate a reporter gene in which the HIV1 LTR regulatory DNA 
sequence controls the expression of enhanced green fluorescent protein (EGFP). The 
following also serves as an example of the kind of preliminary routine variations of 
fusion protein levels and inhibitor levels that may be advantageous to test in the 

10 practice of the present invention. Such routine variations are often helpful in 
validating the assays before a large scale screening project is undertaken. 

The APP/TAT fusion construct is referred to as "pcDNA3.1 zeo (+) 
APP(1-651)SW, K612V-(MlL)TATexonT (see Figure 26) and contains the HTV1 
TAT exon 1 fused just after the transmembrane domain of APP. This construct is also 

15 shown in outline form in Figure IB. "pMM321" refers to a reporter gene plasmid 
consisting of the HIV1 LTR driving the transcription of enhanced green fluorescent 
protein (see Figure 25). As a positive control for TAT expression, a construct in 
which TAT was under the control of a strong, constitutive promoter ( referred to as 
"pUCdSTAT"; see Figure 24) was used. 

20 

METHODS: 

1. Day 1: Pass HEK 293T cells into 2x6 well dishes at 1 x 105 cells/well. 

2. Day 2: Transfect cells with 9 ]iL Fugene and 0.125 (xg pMM321 and various 
25 amounts of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI. 

Plate 1: 1. 5 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 

2. 2.5 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

30 3. 1.25 fig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

(MlL)TATexonI 



-33- 



WO 03/072041 



PCT/US03/05458 



4. 0.625 Hg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

5. 0.312 fig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

(MlL)TATexonI 

5 6. 0.156 ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

(MlL)TATexonI 

Plate 2: 1. 0.08 ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

2. 0.04 ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
10 (MlL)TATexonI 

3. no pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 

4. 0.625 pUCdSTAT 

5. 0.625 jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI and 1 ug pMM321 

15 6. 1 ug pMM321 

Six hours post-transfection, green cells were only observed in plate 2, #5. 

3. Day 3: The fluorescence intensity of the transfected cells was observed and 
20 recorded. 

4. Day 4: The fluorescence intensity of transfected cells was observed and recorded. 
RESULTS: 

25 Co-transfection with pcDNA3. 1 zeo (+) APP(l-65 1)SW, K612V-(MlL)TATexonI 
increased GFP expression in the cells. 

Day 3: 

• 5 ug - no green cells 

30 • 2.5 ug - no green cells (too much DNA for these two transfections?) 

• 1.25 ug - many bright and dim green cells (see photographs and figure in ancillary 
data) 

• 0.625 ug - bright and dim green cells but fewer than at 1.25 ug 

• 0.312 ug - no difference obvious between 0.625 ug and 0.312 |Xg 
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• 0. 156 \ig - very few green cells 

• 0.08 fig - very few green cells 

• 0.04 \ig - very few green cells 

• no pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI - extremely few 
5 (if any) green cells 

• 0.625 (ig pUCd5TAT - cells were extremely bright, not necessarily more in 
number than with pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 

• 0.625 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 1 jig 
pMM321 - many bright and dim green cells. 

10 • 1 \xg pMM321 many-fold fewer green cells, some bright, most dim. 

Day 3 - changed media (saved 1 mL conditioned media from wells Plate 1- 3, 4, 5, 6; 
Plate 2 - 1, 2, 3, 5, 6). Added fresh media with 10 jiM of 1^685,458 (a potent, cell 
permeable y- secretase inhibitor) to wells Plate 1-3,4, 5, 6; Plate 2 - 3, 4, 5, 6. 
15 Waited 48 hours to observe loss of fluorescence since GFP is so stable. 

After 48 hours, all wells appeared brighter than at 24 hour time point. This does not 
necessarily mean that the inhibitor was ineffective, or that the assay did not work, 
since there were no controls run where the inhibitor was not added. However, it does 
20 suggest that under these conditions it may be preferable to add the inhibitor at the time 
of transfection to shut down y-secretase as soon as possible and avoid release of TAT 
and induction of GFP. 

EXAMPLE 2 

25 Transfection of APP(1-651)SW, K612V-(MlL)TATexonI into HEK293T and H4 
cells accompanied bv inhibition of y-secretase activity with L-685,458 

The following example demonstrates the operation of the invention in 
HEK293T cells and H4 cells and shows inhibition of APP processing (and thus TAT 
release) by treatment with a known y-secretase inhibitor. "pcDNA3.1 zeo (+) APP(1- 
30 651)SW, K612V-(MlL)TATexonI," "pMM321," and "pUCd5TAT" are the same as 
in Example 1. H4 cells (ATCC HTB-148) are a neuronal cell line. 
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METHODS: 

Dayl: Plated out 2 x 6 well plates of HEK293T cells and 2 x 6 well plates of H4 cells 
at 1x105 cells/well. 

5 Day 2: Transfected cells with 2 \ig total DNA - pcDNA3.1 zeo (+) APP(1-651)SW, 
K612V-(MlL)TATexonI and carrier (a pET-IN plasmid). 
Platel: 

1,2: 1 |xg pMM321 + 1 fxg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

10 3,4: 1 jig pMM321 + 0.1 |Xg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI + 0.9 \ig carrier 

5: 1 |Xg pMM321 -h 1 fxg carrier (added too much carrier to this well in H4 cells) 
6: 1 (xgpMM321+0.1fXgpUCd5TAT + 0.9 fxg carrier 

15 Plate 2: 

1,2: 0.1 \ig pMM321 + 1 \xg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI + 0.9 ]ig carrier 

3,4: 0.1 \ig pMM321 + 0.1 fig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI + 
20 1.8 jmg carrier (added too 0.5X carrier to this mix in 293T cells) 
5: 0.1 \ig pMM321 + 1.9 |Lig carrier 
6. 0.1 |Xg pMM321 + 0.1 jig pUCdSTAT + 1.8 fxg carrier 

Transfections for HEK293T cells: 9 pL Fugene/well. Combined with DNA in 
25 Optimem and incubated and added to cells according to manufacturer's instructions. 

Transfections for H4 cells: 6 \iL Fugene/well. Combined with DNA in Optimem and 
incubated and added to cells according to manufacturer's instructions. 

30 Added 10 yM L-685,458 to Plates 1 and 2, wells 2 and 4 for both cell types within 1 
hour of transfection. Observed cells periodically. 

Took pictures at 24, 46 hours after transfection, using AE lock to keep exposures 
constant between wells. 
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RESULTS: 

Both H4 and 293T cells turned much brighter green in the presence of pcDNA3.1 zeo 
(+) APP(l-651)SW,K612V-(MlL)TATexonI 
5 At 24 hours: 
H4 cells: 

Plate 1: lugpMM321 

1. 1 ug pMM321 + 1 ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI: Many 

10 bright and dim green cells (good transfection efficiency as well) 

2. 1 ug pMM321 + 1 ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI + 10 yM 1^685,458: Also many bright and dim green cells, but 
reduced compared with 

well#l 

15 3. Very few green cells (a few per field) 

4. Very few green cells 

5. A few dimly green cells 

6. Some induction with 0.1 \ig pUCdSTAT but still relatively few cells. 

20 Plate 2: 0.1 |XgpMM321 

1. 0.1 jxg pMM321 + 1 |LLg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI: 

-10 bright green cells/field and the rest are dim green 

2. 0.1 jig pMM321 + 1 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

25 (MlL)TATexonI + 10 \xM L-685,458: -3-5 bright green cells/field, some dim green, 
and some not green. 

3. No green cells 

4. No green cells 

5. No green cells 

30 6. A few bright green cells 

HEK293T cells: 
Plate 1: l^gpMM321 
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1. + 1 |xg pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI: of 15 cells: 
6 dim, 4 

medium, 5 bright 

2. + 1 jxg pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 10 \M Lr 
5 685,458: 

of 15 cells: 6 very dim, 5 dim, 1 medium, 3 bright 

3. +0.1 lig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI: many more 
green 

cells than 1 pig, lots of strong, bright green cells 
10 4. + 0.1 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 10 ^M Lr 
685,458: 

Fewer bright green cells/field but intensity does not appear strongly diminished 

5. 1 |xg pMM321 alone: Most cells in the field expressing dim to medium levels of 
GFP 

15 6. enhancement by 0. 1 |Xg pUCdSTAT 
Plate 2: 0.1fAgpMM321 

1. + 1 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI: Bright and 
medium 

20 green cells 

2. + 1 iig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 10 L- 
685,458: 

Bright, medium, and dim green cells 

3. + 0.1 iig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI: Bright, 
25 medium, 

and dim green cells 

4. +0.1 jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 10 pML- 
685,458: 

Bright, medium, and dim green cells 
30 5. 0.1 jxg pMM321 alone: Most expressing cells have dim GFP, a few medium to 
bright 
cells 

6. Enhancement by 0.1 fxgpUCd5TAT 
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Changed media on cells at 24 hours past transfection. Kept 10 \xM 1^685,458 on cells 
in wells 2 and 4. 

At 46 hours after transfection, examined the wells again. Lots of floating cells in all 
5 wells, all cell types. Highest number of floaters in 1 \ig pcDNA3.1 zeo (+) APP(1- 
651)SW, K612V-(MlL)TATexonI lanes. 

Took some photographs under fluorescent and white light (white light at low 
intensity) to reveal fluorescent and non-fluorescent cells. Conducted a subjective 
10 analysis of the photographs to see if the amount of inhibition by 10 (xM L-685,458 
was in any way quantifiable. Counted bright (white in the middle), strong (blue 
middle), medium (green) and dim/non-fluorescent cells and determined the 
approximate fraction of each level of expression. Results follow: 

15 TABLE 1 

293T cells transfected with X jxg pMM321 (first number in left column) and X jig 
pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI (second number in left 



column) 



Transfection 


# Bright 


# Strong 


#Med 


#Non 


% Bright 


%Strong 


% Med 


%Non 


lue + iws 


none 


8 


81 


316 




2 


20 


78 


1 + 1 + cmpd 


none 


5 


66 


521 




0.8 


11 


88 


1 + 0.1 


41 


77 


143 


61 


12 


24 


44 


19 


1+0.1+cmpd 


23 


32 


149 


194 











20 The results shown in Table 1 indicate that the presence of L-685,458 ("cmpd") caused 
fewer strong and medium fluorescing cells as well as more non-fluorescent cells in the 
first run; in the second run, L-685,458 caused fewer bright and strong fluorescing cells 
as well as more non-fluorescent cells (with slightly more medium fluorescing cells). 
Overall, these data clearly indicate that the presence of an inhibitor of APP processing 

25 such as L-685,458 can be identified by the present invention. 

1 mL of conditioned media from each well was analyzed for production of Ap. 
Higher than background levels of AP were observed in 293T cells after transfection 
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with 1 jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI and 0.1 |ig 
pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI and higher than 
background levels of A(3 in H4 cells after transfection with 1 \xg pcDNA3.1 zeo (+) 
APP(1-651)SW, K612V-(MlL)TATexonI, but not 0.1 \ig pcDNA3.1 zeo (+) APP(1- 
5 65 1)SW, K6 12V-(MlL)TATexonI (no enhancement of GFP was observed with 0. 1 
\ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI either). A£ was 
completely inhibited to background levels by 10 \jM L-685,458. Surprisingly, 
substantial inhibition of GFP was not observed with 10 |iM L-685,458. 

10 100,000 cells from each well were trypsinized and placed in 0. 1 mL phenol red-free 
media in a Costar 96-well dish and read using the fluorometer. The results are shown 
below: 

TABLE 2 



A 






lugAPPTAT 


10uM458 


O.lugAPPtat 


10uM458 


no tat 


O.lug 

pUCd5TAT 










B 


293T 


lug 

LTRGFP 


17670 


14321 


65535 


65535 


14890 


65535 










C 




O.lug 
LTRGFP 


9976 


10491 


17677 


14790 


9624 


25735 










D 


H4 


lug 

LTRGFP 


21307 


25307 


7175 


7136 


7147 


7277 










E 




O.lug 
LTRGFP 


9574 


10031 


7317 


6957 


6946 


7247 










F 




Blank 


7498 


7124 


7570 


7454 


5774 


7638 










G 


























H 




























1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 



15 In Table 2, "APPtat" is pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI. 
"458" is L-685,458. "LTRGFP" is pMM321. 
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0.1 pig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI with and without 
compound exceeded the maximum reading of the fluorometer, as did the addition of 
0.1 Jig pUCd5TAT to cells transfected with 1 \ig pMM321. 
1 \xg pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 1 \ig pMM321 
5 incrementally increased the amount of fluorescence relative to 1 \ig pMM321 alone, 
and this was reduced to background levels by 10 1^685,458. Inhibition of 
fluorescence was also observed in 293T cells transfected with 0.1 \ig pcDNA3.1 zeo 
(+) APP(1-651)SW, K612V-(MlL)TATexonI + 0.1 \ig pMM321. No inhibition of 
fluorescence was observed in H4 cells under any transfection conditions. 

10 

EXAMPLE 3 
Use of APPf 1-65 nSW. K612V-TATexonI in H4 cells 

L-875,532 is a known y-secretase inhibitor having the structure shown below. It is 
described and details of its synthesis are disclosed in Seiffert et al., 2000, J. Biol. 
15 Chem. 275:34086-34091. 




L-875532 
Compound X is a (3-secretase inhibitor. 

20 

pRBR186 (Figure 22 A) is an expression vector containing DNA sequences encoding 
full-length APP containing the Swedish mutation and the K612V mutation. pRBR186 
does not contain a transcription factor fused to the APP sequences. 
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pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonIis an expression vector 
that directs the expression of a the fusion protein APP(1-651)SW, K612V- 
(MlL)TATexonI in mammalian cells. This fusion protein contains the first 651 
amino acids of APP (with a Swedish version of the (3-secretase cleavage site as well 
5 as the K612V mutation) fused in frame to exon I of HIV1 TAT, which has been 
modified with a Metl-Leu mutation. A schematic diagram of pcDNA3.1 zeo (+) 
APP(1-651)SW, K612V-(MlL)TATexonI is shown in Figure 26A. The nucleotide 
sequence of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI (SEQ ID 
NO:22) is shown in Figure 26B-G. 

10 

METHODS: 

1. H4 cells (ATCC HTB-148) were transfected with the various constructs listed 
below using 6 \\L Fugene per 100 \xL Optimem and 100 \xL Optimem per well (6-well 
dishes). Transfection reactions were incubated for 30 minutes prior to adding 100 \xL 
15 dropwise onto wells. 

Transfections were done as follows: 

1. 1 fig pMM321 (Figure 25A-D) and 1 \ig pcDNA3.1 backbone 

20 la. 1 jLtg pMM321 and 1 jjig pcDNA3.1 (Invitrogen, San Diego, CA) backbone. Prior 
to 

transfection, 10 pM L-875,532 (y-secretase inhibitor) was added to the well. 

2. 1 |utg pMM321 and 1 |Ltg pRBR186 (Figure 22A; APP expression vector; 
processing 

25 and inhibition of processing control) 

2a. 1 (Xg pMM321 and 1 |Xg pRBR186. Prior to transfection, 10 (iM L-875,532 was 

added to cells 
(transfection solution for 3-5 were prepared in bulk) 

3. and 3a. 1 \ig pMM321 and 1 fig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
30 (MlL)TATexonI 

4. and 4a. l\ig pMM321 and 1 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI. Prior to transfection, 10 \xM L-875,532 was added to the two 

wells. 

5. and 5a. 1 [xg pMM321 and 1 \ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
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(MlL)TATexonI. Prior to transfection, 10 uM Compound X was added to the two 
wells 

6. 1 ug pMM321 and 1 ug pUCd5TAT (Figure 24). 

7. 1 ug pMM321 and 1 ug pUCd5 TAT. Prior to transfection, 10 uM L-875,532 was 
5 added to the cells. 

RESULTS: 

Cells were assessed by eye under a fluorescence microscope the morning following 
transfection (-20 hrs). 
10 1 and la, 2 and 2a. Weak fluorescence 

3 and 3 a. Much stronger fluorescence 

4 and 4a. Clear inhibition of fluorescence 

5 and 5a. Possible inhibition of fluorescence, but doesn't look that great 

6 and 7. Almost blindingly fluorescent. 

15 

At approximately 48 hours, cells were trypsinized, spun down, and resuspended in 
100 uL PBS. The cellular contents of each well of the transfection plates were placed 
into one well of a 96-well fluorescence plate. Fluorescence was analyzed using the 
FLUOstar (485 excitation/538 emission). The results are shown in Table 3. 

20 

TABLE 3 



Transient transfections in H4 cells Fluor Units 

pMM321 4484 

pMM321+ L-875,532 3443 

P MM321+pRBR186 2735 

pMM321 + pRBR186 + L-875,532 2161 

pMM321 + APP-TAT-ct32 20177 

pMM321 + APP-TAT-ct32 + L-875,532 8283 

pMM321 + APP-TAT-ct32 + Compound X 1 1946 

pMM321 + pucd5-TAT 61 102 



In Table 3, "APP-TAT-ct32" refers to pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI. 
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For a graphical presentation of these results, see Figure 19. In Figure 19, "UTR-GEP" 
refers to pMM321 ; "APP-TAT-ct32" refers to pcDNA3. 1 zeo (+) APP(l-65 1)SW, 
K612V-(MlL)TATexonL Compare the bar labeled "LTR-GFP + APP-TAT-ct32" 
5 with the bars labeled "LTR-GFP + APP-TAT-ct32 + L-875,532" and "LTR-GFP + 
APP-TAT-ct32 + Compound X." Inhibition by both the p-secretase inhibitor 
(Compound X) and the y-secretase inhibitor (L-875,532) is easily identified by the 
present invention. 

10 Conclusions: 

• The data indicate that the expression of the fusion protein APP(1-651)SW, K612V- 
(MlL)TATexonI enhances transactivation through the LTR of pMM321 in a manner 
that depends on APP processing. 

• APP(1-651)SW, K612V-(MlL)TATexonI expressing cells were 6X brighter than 
15 pMM321 cells alone. 

• Treatment with L-875,532 decreased fluorescence 2.5X. 

• Treatment with Compound X decreased fluorescence 1.7X. 

• Expression of TAT via pucd5-TAT was almost blinding and was 19X above 
pMM321 alone, indicating that APP(1-651)SW, K612V-(MlL)TATexonI expression 

20 did not lead to levels of GFP as high as TAT alone. Despite the decreased activation 
shown by TAT when provided by the fusion protein, as compared with TAT driven by 
the AMLP (adenovirus major late promoter) in pucd5-TAT, the assay was easily able 
to identify both the P-secretase and the y-secretase inhibitors. 

• Control plasmids (pMM321 and pMM321 + pRBR186) were dimly fluorescent and 
25 were not inhibited by L-875,532. 

A lower level of inhibition by the P-secretase inhibitor is to be expected since the 
K612V mutation decreases alpha-secretase activity by 95% and thus some alpha- 
secretase cleavage is to be expected. 

30 
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EXAMPLE 4 

Construction of pcDNA3.1 (+) zeo APP(1-651)SW, K612V, GAL4-VP16(M1L) APP 
f664-695) 

1. The GAL4-VP16 insert was prepared by PCR from pCR2.1 GAL4VP16 (Figure 
5 20) (Invitrogen, San Diego, CA). The PCR was performed to eliminate the N- 

terminal methionine by changing this methionine into a leucine. 

40 ng pCR2.1 GAL4-VP16 
0.2 ML GAL4VP16 5' oligo at 250 uM: 
10 5'-CTG AGATATC AAGCTACTGTCTTCTATCGAACAAGC-3' (SEQ ID NO:56) 

EcoRV site underUned 

0.2 uL GAL4VP16 3' oligo (at 250 pM): 5'- 

GCGCGATATCCCCACCGTACTCGTCAATTCC-3' (SEQ ID NO:57) 
EcoRV site underlined 
15 5 UJL10X Buffer 
8pL25mMMgCl2 
4nLPCRdNTPs 
0.25 (iL AmphTaq Gold 
27.35 uL water 

20 

Cycle: 

Purified reactions using a Qiaquick column 
Digested entire reaction using EcoRV 
25 Ran the DNA on a 1 % gel. Excised the band and purified using a QiaQuick gel 
purification kit 

2. Digested pcDNA3.1 APP(l-651)/APP(664-695) with EcoRV and SAP treated. 
pcDNA3.1 APP(l-651)/APP(664-695) is an intermediate plasmid formed in the 

30 procedure described in Example 6. pcDNA3.1 APP(l-651)/APP(664-695) the first 
651 amino acids of APP (with a Swedish version of the 0-secretase cleavage site as 
well as the K612V mutation) fused in frame to the last 32 amino acids of APP. 
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3. Ligated pcDNA3.1 APP(l-651)/APP(664-695) -EcoRV digested to GAL4VP16 
(EcoRV digested) 

5 4. At this point, it was realized that the 3' PCR primer for GAL4-VP16 put the 
APP(664-695) fragment out of frame. The APP(664-695) fragment was then re- 
PCR'd using the following protocol: 

1 MLpcDNA3.1 APP(l-651)-Gal4VP16-APP(664-695) 
10 50 nM APP NotI 5'ct32 in frame with GAL4-VP16 
50nMAPPNoti3' ct32(5' 

(p)CTGCTGT GGCGGCCGC CTAGTTCTGCATCTGCTC) (SEQID NO:58) 
NotI site underlined 

1 fjL PCR dNTPs (10 mM each dNTP, Roche) 
15 5 \xL 10X Expand Buffer with MgCl2 

40 jiL water 

1 fxL Expand polymerase (Roche) 

The PCR fragment was run on a 4% agarose gel and gel-purified using a QiaQuick gel 
20 purification column 

The fragment was digested with NotI and purified using a QiaQuick PCR purification 
column. 

5. APP(l-651)-Gal4VP16-APP(664-695) was re-miniprepped. Miniprep #1 was 
25 digested with NotI, run on a 1% gel, the upper band was then isolated and SAP- 
treated. 

6. APP(l-651)-Gal4VP16/NotI digested/SAP-treated was ligated to APP(664-695). 
30 7. Minipreps containing inserts were sequenced to verify the orientation of the insert. 
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EXAMPLES 

Construction of pcDNA3.1 zeo (+) APP(l-651)wt, K612V-TATexonI(MlL) 
APP(664-695) 

This procedure replaced a fragment of APP in pcDNA3.1 zeo (+) APP(1-651)SW, 
5 K612V-TATexonI(MlL) APP (664-695) that contained the Swedish mutation with a 
corresponding fragment from pRBR121 containing the wild-type P-secretase cleavage 
site rather than the Swedish P-secretase cleavage site. 

1. pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
10 (Figure 21B) was digested with SnaBI and then EcoRI 

17 ixL miniprep DNA 
2 uLlOX buffer 
1 pL SnaBI (NEB) 

15 Digest was purified using Qiaquick PCR purification kit. Entire digest was then 
cleaved with EcoRI for 2 hours. 

2. pRBR121 (Figure 21 A) was digested with SnaBI and then EcoRI 

5]agpRBR121 
20 5 ML10X buffer 

2.5 \xL SnaBI (NEB) 
q.s. 50 \xL with water 

The digest was purified using a Qiaquick PCR purification kit. The entire digest was 
25 then cleaved with EcoRI for 2 hours . 

3. Both digests were run out on a 1% agarose gel. From the pRBR121 lane, the 2.4 
kb SnaBI-EcoRI fragment containing the wild-type P-secretase cleavage site was 
isolated. 
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From pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
digest, BOTH the 5 kb SnaBI-EcoRI backbone fragment AND the 200 bp EcoRI- 
EcoRI fragment were isolated (see Figure 21B). 

5 4. A three-part ligation using equal molar ratios of the three fragments was carried 
out: 

The assumption was made that, since the starting plasmids were of similar sizes and 
the same amount was digested for each plasmid, the recovered fragments would be in 
approximately equal molar ratios. 

10 

Vector alone: 

l|iL APP-TAT-ct32 SnaBI/EcoRI 5Kb fragment 
7 |llL water 
2 |iL5X buffer 
15 10 \xL 2X buffer (Roche rapid ligation kit) 
lMLT41igase 

1+1+1 

1 (iLpcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
20 SnaBI/EcoRI 5KB fragment 

1 nL pRBR121 SnaBI/EcoRI 2.4 Kb insert 

1 mL pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
EcoRMBcoRI 200 bp insert 
5 |iL water 
25 2 mL 5X buffer 
10nL2X buffer 
1 jLiLT41igase 

1+1+. . . 1 (in this 3-pt ligation, the ligation of two of the fragments together was done 
30 1 st , then the third fragment was added) 

1 MLpcDNA3.1 Z eo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
SnaBI/EcoRI 5Kb backbone 

1 MLpcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
EcoRl/EcoRI 200 bp insert 



-48- 



WO 03/072041 PCT/US03/05458 



5 jiL water 
2\xL5X buffer 
10 ML 2X buffer 
1 jiLT41igase 
5 waited 5 minutes 

then added lyL pRBR121 SnaBI/EcoRI 2.4 kb insert 

1+1+3 

1 mL pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
10 SnaBI/EcoRI 5 kb backbone 

1 pRBR 121 SnaBI/EcoRI 2.4 kb insert 

3 MLpcDNA3.1 ze0 (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
EcoRI/EcoRI 200 bp insert 
3 jjL water 
15 2 ML 5X buffer 
10 ML 2X buffer 
lMLT41igase 

Transformed and plated out 200 ML. The number of colonies in the vector + insert 
20 ligations far exceeded the number of colonies in the vector alone ligation. Picked 12 
colonies from 1+ 1+...1. 
Picked 6 colonies from 1+3. 
Miniprepped 

Digested with EcoRI to ensure that small 200 bp fragment was incorporated. 
25 RESULTS 

Minipreps #10 and 15 contained 200 bp EcoRI fragment. 

Oriented with Bam HI digestion. 

30 Sequenced with sAPPb F2 and F3 primers. Miniprep #15 contains both inserts in the 
correct orientation. 
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EXAMPLE 6 

Construction of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) 
APP(664-695) 

1. PCR of APP (664-695): 

5 The starting material was the pRBR186 plasmid (Figure 22A). 

PCR of APP (664-695) 
lngpKBR186 

50 nM 5* oligo (5 , -TGCCCCGCGCGGCCGCGCGATGCTGCCCGG-3 , ) (SEQ ID 
10 NO:59) NotI site underlined 

50 nM 3' oligo (5 ' -(p) ATGGTGTGGCGGCCGC AGACGCCGCTGTC ACC-3 ' ) 

(SEQ ID NO:60) NotI site underlined 

1 \xL Roche PCR nucleotides 

5 yL 10X Expand buffer 
15 40 iHL water 

1 juL Expand 

Cycle: (94°C, 5 min) - 25X (94°C, 30 sec; 42°C, 1 min; 72°C, 2 min) - 72°C x 6 min 
-4°C hold. 

20 

The -100 bp fragment was gel purified (4% agarose, IX TBE gel) 

The gel-purified fragment was ligated into NotI digested, Shrimp Alkaline 
Phosphatase-treated pcDNA3.1 zeo (+) (Invitrogen). The presence of the insert and 
25 its orientation was confirmed by sequencing. 

2. PCR of APP(1-651): 
1 ngpRBR186 

50 nM 5' oligo (5'-(p)AGCGCACAAGCTTCCCCGCGCAGGGTCGCGATGCTG- 
30 3') (SEQ ID NO:61) ffindEI site underlined, Met(l) ATG of APP in bold 

50nM3' oligo (5 7 -(p)GG ATG TAAGCTT TTTCTTCTTC AGC ATC A 9 ) 

(SEQ ID NO:62) Hindin site is underlined 
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1 \xL Roche PCR nucleotides 
5 jjlL 10X Expand buffer 
40 |JL water 
1 pL Expand 

5 

Cycle: (94°C, 5 min) ~ 25X (94°C, 30 sec; 37°C, 1 min; 72°C 2.5 min) - 72°C x 6 
min - 4°C hold 

• The amplified fragment was isolated on an agarose gel. The fragment was 

10 purified from the gel using Qiaquick Gel purification columns. The fragment was 
digested with HindHI. The amount of the fragment was too small to subclone, so 
the PCR was repeated using 1 \xL of the amplified fragment and carrying out 5 
reactions simultaneously. 

15 • The fragments were purified from these reactions using a QiaQuick PCR 

purification kit. The fragments were eluted in 30 |iL and digested with HindHI for 
2 hours. The digested fragments were then gel purified. 

• The purified fragments were ligated to pcDNA3.1 zeo (+) APP(664-695) that had 
20 been digested with HindHI and SAP treated. This gave the intermediate plasmid 

pcDNA3.1 zeo (+) APP(l-651)/APP(664-695). 

3. PCR of (MIL) TAT: 

The starting material was NL4-3 viral plasmid (Figure 22B). 

25 

PCR reaction: 

1 ng NL4-3 viral plasmid 

50 nM TAT 5' RV Met-Leu PCR primer (5'- 

TGCAGATATCCTGGAGCCAGTAGATCCTAGAC-3 ') (SEQ ID NO:63) 
30 EcoRV site underlined, Met-Leu mutation in bold 
50 nM TAT 3' RV Met-Leu PCR primer 

(5 ' -GCTGGATA^CTCTGCTTTGATAGAGAAGC-3 ' ) (SEQ ID NO:64) 
EcoRV site underlined 
1 ML PCR dNTPs 
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5 \*L PCR 10X buffer with MgC12 
40 \jL water 

1 )iL Expand polymerase 

5 Cycle: 

94°C for 5 miii 

[30 sec 94°C, 1 min 42°C, 1 min 72°C] x 25 cycles 

5 min at 72C 

holdat4°C 

10 

• The insert was purified over QiaQuick PCR purification column 

• The entire reaction was digested with 30 units EcoRV for 3 hours 

• The -200 bp insert was gel purified. 

• pcDNA3.1 zeo (+) APP(l-651)/APP(664-695) was digested with EcoRV, and 
15 then SAP treated 

• The Metl-Leu TAT fragment was ligated to pcDNA3.1 zeo (+) APP(1- 
651)/APP(664-695). 

A map of the resulting plasmid is shown in Figure 22C. 

20 EXAMPLE 7 

Design of novel expression vector for expression of APP(1-651)SW, K612V- 
TATexonKMlD APPf 664-695) 

PURPOSE: To provide a low level expression of APP(1-651)SW, K612V- 
TATexonI(MlL) APP(664-695), a prokaryotic selectable marker that is NOT 
25 ampicillin (read-through of the P-lactamase gene is sometimes a problem), and a 
eukaryotic selectable marker that is NOT zeocin (zeo is the marker for the reporter 
plasmids in some embodiments). 

METHODS 

30 L The dEYFP gene was removed from pd2EYFP (Clontech, Palo Alto, CA) using 
Bamlfl and Notl. The 5' overhangs was filled in using Klenow, and the plasmid was 
re-circularized 
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pd2EYFP plasmid was digested with BamHI, Notl. 

Ran reaction on 1% agarose gel. Digestion was complete. Cut out 3.4 kb band. 
Purified using Qiagen Gel Extraction Kit. 
5 Klenow fill-in: 

~4 (xg plasmid backbone 
7.5 \xL NEB buffer 2 

33 each dNTP (diluted from Roche PCR dNTPs) 
water to 75 |iL 
10 4 fiL Roche Klenow fragment (4 units) 

Incubated at room temperature for 15 minutes 
Heat inactivated at 70°C 

15 Took 1 jllL fill-in reaction. 
Diluted to 8 jiL with water 
Added 2 pL 5X DNA buffer 
Added 10 |aL 2X ligation Buffer 
Added 1 nL T4 DNA ligase 

20 

Incubated at room temperature 

Transformed 2 |iL ligation into Invitrogen maximum efficiency DHSalpha competent 
cells, 

25 

Plated out on Kanamycin plates. Lots of colonies. 

2. The RSV promoter from pREP4 (Invitrogen) was excised using BgUI and Hindm 
and cloned into the re-circularized plasmid. 

30 

Digested 5 \xg pREP4 with Hindm 
Purified using Qiagen PCR purification kit 
Digested with BglH. 



-53- 



WO 03/072041 



PCT/US03/05458 



The RS V promoter fragment was gel purified and cloned into the re-circularized 
plasmid. 

3. The resulting expression vector (pRSV Kan/Neo res; Figure 23) has the eukaryotic 
5 RSV promoter 5' to the pd2EYFP polylinker, SV40 driving neo and kanamycin 

prokaryotic selection, and a pUC ori for high levels of replication in bacteria. 

EXAMPLE 8 

Use of APP(1~651)SW, K612V-TATexonI(MlL) APP(664-695) in HeLa cells with a 
10 ft-galactosidase reporter gene 

The following demonstrates the practice of the present invention with 
the APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) fusion protein (SEQ ID 
NO:2) and P-galactosidase as a reporter gene. P4-R5 cells are HeLa cells that contain 
a stably integrated p-galactosidase reporter gene under the control of the HTV1 LTR. 

15 

Materials: 

1. ) Cells: P4-R5 cells 

2. ) DNA: 0.78 jxg/|jLpcDNA3.1 zeo (+)AFP(1-651)SW, K612V- 
TATexonI(MlL) APP(664-695) 

20 3.) Transfection reagents: FUGENE® 

4. ) Media: OPTIMEM® 

cDMEM (-)phenol red /10% FBS 

5. ) Compounds: Compound X (P-secretase inhibitor) 10 mM 

L-875,532 (y-secretase inhibitor) 10 mM 

25 

Protocol: 
Davl 

1.) Cell count on P4-R5 cells = 7.6 x 105 cells per ml in cDMEM (-)PR. 
Seeded sterile white luminometer TC plates with the following cell numbers: 
30 10 ml 

5 x 103/well = 0.75 ml in 9.25 ml media 
1.0 x 104/well = 1.5 ml in 8.5 ml media 
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Seeded 100 \xL cells per well. 
Incubate overnight at 37°C, 5% C02. 

Day 2 

2. ) Made up media with appropriate dilutions of inhibitors. 

5 On no-inhibitor controls, added 100 \\L of cDMEM with 1% DMSO 

On wells with Compound X, added 10 fiM inhibitor in cDMEM 
On wells with L-875,532, added 10 |jM inhibitor in cDMEM 

3. ) Prior to transfectibn, pulled off media on P4-R5 cells and replaced with media -/+ 
inhibitor. 

10 FUGENE® transfection: 

For FUGENE® transfection: 

4. ) Added 600 \xL of OPTIMEM® to sterile EPPENDORF® tube and carefully 
added 30 \\L FUGENE® to media, without touching walls of tube. Incubated at room 
temperature for 5 minutes. 

15 In separate EPPENDORF® tubes, added each DNA. 

Added FUGENE®/OPTTMEM® dropwise to DNA; incubated at room temperature 
for 15 minutes. 

Added 15 \xL /well of DNA/FUGEl^®/OFnMEM® dropwise to media in 
appropriate wells on P4-R5 cells, swirling to mix. 



TABLE 4 



Transfection 
number 


Cone of 
DNA 


Vol of DNA 


Vol of 
FUGENE® 


Vol of sterile 
OPTIMEM® 


APP-ML-Tat- 
APPct 


0.78 [i%J\iL 


5fig= 6.5 |JL 


30|iLof 
FUGENE® 


600 ixL of 
OPTIMEM® 


pUCdSTAT 


1.24 Hg/ML 


5^g = 4.0nL 


30|iLof 
FUGENE® 


600 \xL of 
OPTIMEM® 


p243-4 


0.56 jxg/jxL 


5 \xg = 9 mL 


30nLof 
FUGENE® 


600 \xL of 
OPTIMEM® 



In Table 4, "APP-ML-Tat-APPct" refers to pcDNA3.1 zeo (+)APP(1-651)SW, 
25 K612V-TATexonI(MlL) APP(664-695). "pUCd5TAT" is an expression vector that 
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serves as a positive control for TAT expression, since it is a construct in which TAT is 
under the control of a strong, constitutive promoter (see Figure 24). "p243-4" is a 
control expression vector that directs the expression of APP. 

5 5.) Plates were transferred to an incubator and incubated for 48 hours to allow 
expression and processing of the proteins. 
Day 4 

6.) The protocol below was followed for lysis of the cells and measurement of P- 
galactosidase in the cell lysates. 

10 

Measurement of P-galactosidase in lysates of transfected cells. 

1 . Removed TROPDC® chemiluminescence kit(s) from cold 
room, allowed to come to room temperature in a 37°C water bath. 

2. p-galactosidase standards were prepared: 

15 Made 1 :5000 dilution of P-galactosidase stock (1 mg/ml) in lysis buffer. Did 2 fold 
dilutions. 

3. Diluted TROPK® substrate 1:25 into buffer. (Made enough 
for 100 (XL /well). 

4. Added to reservoir and added 100 (ilVwell. 

20 5. Added 10 joL of P-galactosidase standards to column 12 on 

plate and incubated in dark for 1 hour. 

6. Read immediately in luminometer using standard file. Filled in 
required fields, read plate. 

25 The results are shown in Figure 33. Figure 33 demonstrates that the present invention 
was able to identify both the p-secretase inhibitor (Compound X) and the y-secretase 
inhibitor (L-875,532). In Figure 33, "APP-tat-ct32" refers to pcDNA3.1 zeo 
(+)APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695). Although not indicated 
in Figure 33, the results for the controls were as expected: a large transactivation of 

30 the LTR by pUCdSTAT was observed which was not affected by either inhibitor. No 
transactivation was seen with p243-4. 
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EXAMPLE 9 

Comparison of the use of APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) 
and APP(l-651)wt, K612V-TATexonI(MlL) APP(664-695) with a p-galactosidase 
reporter gene 

5 The following shows a side-by-side comparison of the practice of the 

present invention with the APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) 
fusion protein (SEQ ID NO:2) and the APP(l-651)wt, K612V-TATexonI(MlL) 
APP(664-695) fusion protein (SEQ ID NO:4). P4-Pv5 cells are HeLa cells that contain 
a stably integrated P-galactosidase reporter gene under the control of the HTV1 LTR. 

10 

Materials: 

1. ) Cells: P4-R5 cells 

2. ) DNA: 0.78 \ig/\xL pcDNA3.1 neo (+) APP(1-651)SW, K612V- 
TATexonI(MlL) APP(664-695) 

15 0.812 fAg/nL pcDNA3.1neo(+)APP(l-651)wt,K612V- 

TATexonI(MlL) APP(664-695) 

1.24 ng/ML P UCd5TAT 
0.56 \LgJlxL p243-4 

3. ) Transfection reagents: FUGENE® 
20 4.) Media: OPT1MEM® 

cDMEM (-)phenol red /10% FBS 
5.) Compounds: Compound X (p-secretase inhibitor) 10 mM 
L-875,532 (Y-secretase inhibitor) 10 mM 

Dayl 

25 1 .) Cell count on P4-R5 cells = 5 x 105 cells per ml in cDMEM (-)PR. 

Seeded sterile white luminometer TC plates with the following cell numbers: 

10 ml 

5 x 103/weU = 4.0 ml in 36.0 ml media 
Diluted 1:1 into media and seeded one plate at 2.5 x 103/well. 
30 Seeded 100 fiL cells per well. 

Incubated overnight at 37°C, 5% CO2. 

Day 2 
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2. ) Made up media with appropriate dilutions of inhibitors. 

On no-inhibitor controls, added 100 pL of cDMEM with 1% DMSO 
On wells with Compound X, added titration curve from 100 \xM. 
5 inhibitor in cDMEM (-)PR. 

On wells with L-875,532, added titration curve from 100 fiM inhibitor 
in cDMEM (-)PR. 

3. ) Prior to transfection, pulled off media on P4-R5 cells and replaced with media -/+ 
10 inhibitor. 

FUGENE© transfection: 

4. ) Added volume of OPTIMEM® to sterile EPPENDORF® tube and carefully 
15 added correct volume of FUGENE® to media, without touching walls of tube. 

Incubated at room temperature for 5 minutes. 

In separate EPPENDORF® tubes, added each DNA, as outlined below. 
Added FUGENE®/OPTIMEM® dropwise to DNA; incubated at room 
temperature for 15 minutes. 
20 Added 15 Ml/well of DNA/FUGENE®/OPTIMEM® dropwise to media in 

appropriate wells on P4-R5 cells, swirling to mix. 



TABLES 



TraBsfection number 


Cone of 
DNA 


Vol of DNA 


Vol Of 
FUGENE® 


Vol of sterile 
OPTIMEM® 


l.)APP-ML-Tat- 
APPct (Sw) 


0.78 (ig/f-iL 


10ng= 13 pL 


60 pi of 

FUGENE® 


1200 pL of 
OPTIMEM® 


2.) APP-ML-Tat- 
APPct (WT) 


0.812 


10|*g=12.2 
pL 


60nLof 
FUGENE® 


1200 f-iL of 
OPTIMEM® 


3.) pUCd5TAT 




5jig = 4.0nL 


30nLof 
FUGENE® 


600 mL of 
OPTIMEM® 


4.)p243-4 


0.56 (ig/|oL 


5fig = 9nL 


30nLof 
FUGENE® 


600 ML of 
OPTIMEM® 
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In Table 5, "APP-ML-Tat-APPct (Sw)" refers to pcDNA3.1 neo (+) APP(1-651)SW, 
K612V-TATexonI(MlL) APP(664-695). "APP-ML-Tat-APPct (WT)" refers to 
pcDNA3.1 neo (+) APP(l-651)wt, K612V-TATexonI(MlL) APP(664-695). 
5 "pUCd5TAT" is an expression vector that serves as a positive control for TAT 
expression, since it is a construct in which TAT is under the control of a strong, 
constitutive promoter (see Figure 24). "p243-4" is a control expression vector that 
directs the expression of APP. 

10 5.) Plates were transferred to an incubator and incubated for 36 hours to allow 
expression and processing of proteins. 

Day 4 

15 6.) The protocol below was followed for lysis of the cells and measurement of P- 
galactosidase in the cell lysates. 

Measurement of B-galactosidase in lvsates of transfected cells. 

1. Removed TROPIX® chemiluminescence kit(s) from cold room, allowed to 
20 come to room temperature in a 37°C water bath. 

2. P-galactosidase standards were prepared: 

Made 1:5000 dilution of p-galactosidase stock (1 mg/ml) in lysis buffer. Did 2 
fold dilutions. 

3. Diluted TROPIX® substrate 1:25 into buffer. (Made enough for 120 fiL 
25 /well). 

4. Added to reservoir and added 120 pi/well. 

5. Added 10 juL of P-galactosidase standards to column 12 on plate and 
incubated in dark for 1 hour. 

6. Read immediately in luminometer using standard file. Filled in required 
30 fields, read plate. 

The results are shown in Figure 34. In Figure 34, "APP(NFEV)HAMycFLAG" refers 
to a protein that is a variant of APP in which NFEV is present at the P-secretase 
cleavage site and there are epitope tags in the amino terminal portion of the protein 
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but there is no transcription factor fused to APP. "APP(Sw)tat-ct32" refers to 
pcDNA3.1 neo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695). 
"APP(WT)tat-ct32" refers to pcDNA3.1 neo (+) APP(l-651)wt, K612V- 
TATexonI(MlL) APP(664-695). Figure 34 shows that the Swedish version and the 
5 wild-type version of APP appear to work about equally well in the assay. 

EXAMPLE 10 

L-685.458 

L-685,458 is a y-secretase inhibitor having the following structure: 




L-685,458 contains an hydroxyethylene dipeptide isostere and is 
thought to function as a transition state analog mimic of aspartyl proteases (Shearman 
et al., 2000, Biochemistry 39:8698-8704). L-685,458 was prepared as follows: 

15 {lS-Benzyl-4R-[l-(lS-carbamoyl-2-phenylethylcarbamoyl)-lS-3- 

methylbutylcarbamoyl]-2R-hydroxy-5-phenylpentyl}carbaixiic acid tert-butyl ester 
(L-685,458) was prepared by the coupling of 2R-benzyl-5S-tert- 
butoxycarbonylamino-4R-(tert-butyldimethylsilanyloxy)-6-phenylhexanoic acid 
(Evans et al., 1985, J. Org. Chem. 50:4615-4625) with Leu-Phe-NH2 followed by 

20 deprotection with tetrabutylammonium fluoride. The synthesis of { lS-benzyl-4R-[l- 
(lS-carbamoyl-2-phenylethylcarbam^ 

phenylpentyljcarbamic acid tert-butyl ester (L-682,679) has been described previously 
(De Solms et al., 1991, J. Med. Chem. 34:2852-2857). { lS-Benzyl-4R-[l-(lS- 
carbamoyl-2-phenylethylcarbamoyl)-lS-3-methylbutylcarbamoyl]-2-oxo-5- 
25 phenylpentyljcarbamic acid tert-butyl ester (L-684,414) was prepared by pyridinium 
dichromate-mediated oxidation of L-682,679. 



-60- 



WO 03/072041 



PCT/US03/05458 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art from 
5 the foregoing description. Such modifications are intended to fall within the scope of 
the appended claims. 

Various publications are cited herein, the disclosures of which are 
incoiporated by reference in their entireties. 
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WHAT IS CLAIMED IS: 

1 . A DNA molecule comprising a nucleotide sequence encoding a 
fusion protein comprising amino acids 589-651 selected from the group consisting of 

5 wild type APP695, the Swedish version of APP695 and the NPEV (SEQ ID NO:40) 
version of APP695 and a transcription factor where the transcription factor is fused in 
frame to the carboxyl terminus of amino acids 589-651. 

2. The DNA molecule of claim 1 where amino acids 589-651 
10 contain a K612V mutation. 

3. The DNA molecule of claim 1 where the nucleotide sequence 
further encodes amino acids 664-695 of APP695 wherein amino acids 664-695 are 
fused in frame to the carboxyl terminus of the transcription factor. 

15 

4. The DNA molecule of claim 1 where the transcription factor is 
selected from the group consisting of: HTV-l TAT, Gal4-VP16, the entire Gal4 
protein, LexA-VP16, E12, E47, Twist, Papillomavirus E2, EBV Zta, BIV TAT, HIV- 
2 TAT, or SIV TAT. 

20 

, 5. The DNA molecule of claim 3 where the fusion protein is 
selected from the group consisting of: APP(l-651)wt, K612V-TATexonI(MlL) APP 
(664-695) (SEQ ID NO:4); APP(l-651)wt, K612V, Gal4-VP16(M1L) APP (664-695) 
(SEQ ID NO:8); APP(l-651)wt, TATexonI(MlL) APP (664-695) (SEQ ID NO:12); 
25 and APP(l-651)wt, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO: 16). 

6. The DNA molecule of claim 3 where the fusion protein is 
selected from the group consisting of: APP(1-651)SW, K612V-TATexonI(MlL) APP 
(664-695) (SEQ ID NO:2); APP(1-651)S W, K612V, Gal4-VP16(M1L) APP (664- 

30 695) (SEQ ID NO:6); APP(1-651)SW, TATexonI(MlL) APP (664-695) (SEQ ID 
NO:10); and APP(1-651)SW, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO:14). 

7. The DNA molecule of claim 3 where the fusion protein is 
selected from the group consisting of: APP(1-651)NFEV, K612V-TATexonI(MlL) 
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APP (664-695) (SEQ ID NO:23) and APP(1-651)NEEV, K612V, GAI4-VP16(M1L) 
APP (664-695) (SEQ ID NO:25), 

8. An expression vector comprising the DNA molecule of claim 1 . 

5 

9. A eukaryotic cell comprising the DNA molecule of claim 1 . 

10. The cell of claim 9 further comprising a reporter gene where the 
reporter gene is under the control of a regulatory DNA sequence that is capable of 

10 being activated by the transcription factor. 

11. A method of identifying a substance that inhibits APP 
processing comprising: 

(a) providing a recombinant eukaryotic cell which: 

15 (i) expresses a fusion protein comprising amino acids 589- 

651 selected from the group consisting of wild type APP695, the Swedish version of 
APP695 and the NEEV (SEQ ID NO:40) version of APP695 and a transcription factor 
where the transcription factor is fused in frame to the carboxyl terminus of amino 
acids 589-651; and 

20 (ii) comprises a reporter gene operably linked to a 

regulatory DNA sequence which capable of being activated by the transcription factor; 

(b) measuring the level of reporter gene product in the cell in the 
absence of the substance; 

(c) adding the compound to the cell and measuring the level of 
25 reporter gene product in the cell in the presence of the substance; 

where a decrease in the level of reporter gene product in the presence 
as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

30 12. The method of claim 1 1 where amino acids 589-65 1 contain a 

K612V mutation. 

13. The method of claim 1 1 where the transcription factor is 
selected from the group consisting of: HTV-l TAT, Gal4-VP16, the entire Gal4 
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protein, LexA-VP16, E12, E47, Twist, PapiUomavirus E2, EBV Zta, BIV TAT, HIV- 
2 TAT, orSIVTAT. 

14. The method of claim 1 1 where the fusion protein is selected 
5 from the group consisting of: APP(l-651)wt, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID NO:4); APP(l-651)wt, K612V, Gal4-VP16(M1L) APP (664-695) 
(SEQ ID NO:8); APP(l-651)wt, TATexonI(MlL) APP (664-695) (SEQ ID NO: 12); 
and APP(l-651)wt, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO: 16). 

10 15. The method of claim 1 1 where the fusion protein is selected 

from the group consisting of: APP(1~651)SW, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID NO:2); APP(1-651)SW, K612V, Gal4-VP16(M1L) APP (664-695) 
(SEQ ID NO:6); APP(1-651)SW, TATexonI(MlL) APP (664-695) (SEQ ID NO: 10); 
and APP(1-651)SW, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO:14). 

15 

16. The method of claim 11 where the fusion protein is selected 
from the group consisting of: APP(1-651)NFEV, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID NO:23) and APP(l-65 1)NFEV, K612V, Gal4-VP16(M1L) APP (664- 
695)(SEQIDNO:25). 

20 

17. A method of identifying a substance that inhibits APP 
processing comprising: 

(a) providing a recombinant eukaryotic cell which: 

(i) expresses a fusion protein comprising an amino acid 
25 sequence from APP that is capable of being cleaved by both P-secretase and y- 

secretase and a transcription factor where the transcription factor is fused in frame to 
the amino acid sequence from APP; and 

(ii) comprises a reporter gene operably linked to a 
regulatory DNA sequence which capable of being activated by the transcription factor; 

30 (b) measuring the level of reporter gene product in the cell in the 

absence of the substance; 

(c) adding the compound to the cell and measuring the level of 
reporter gene product in the cell in the presence of the substance; 
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where a decrease in the level of reporter gene product in the presence 
as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

5 18. The method of claim 17 where the amino acid sequence from 

APP comprises an amino acid sequence selected from the group consisting of 589-651 
of APP695, 589-651 of the Swedish version of APP695, and 589-651 of the NFEV 
version of APP695. 

10 19. The method of claim 17 where the amino acid sequence from 

APP contains the amino acid sequence NLDA (SEQ ID NO:38) at the P-secretase 
cleavage site instead of the wild-type sequence KMDA (SEQ ID NO:34). 

20. The method of claim 17 where the amino acid sequence from 
15 APP contains the amino acid sequence NFEV (SEQ ID NO:40) at the P-secretase 
cleavage site instead of the wild-type sequence KMDA (SEQ ID NO:34). 
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DNA sequence of APPU-65USW. K612V-TATexonl(M L) APP (664-695) 
(SEQ ID NO: 1) 

1 ATGCTGCCCG GTTTGGCACT GCTCCTGCTG GCCGCCTGGA CGGCTCGGGC 
51 GCTGGAGGTA CCCACTGATG GTAATGCTGG CCTGCTGGCT GAACCCCAGA 
101 TTGCCATGTT CTGTGGCAGA CTGAACATGC ACATGAATGT CCAGAATGGG 
151 AAGTGGGATT CAGATCCATC AGGGACCAAA ACCTGCATTG ATACCAAGGA 
201 AGGCATCCTG CAGTATTGCC AAGAAGTCTA CCCTGAACTG CAGATCACCA 
251 ATGTGGTAGA AGCCAACCAA CCAGTGACCA TCCAGAACTG GTGCAAGCGG 
301 GGCCGCAAGC AGTGCAAGAC CCATCCCCAC TTTGTGATTC CCTACCGCTG 
351 CTTAGTTGGT GAGTTTATAA GTGATGCCCT TCTCGTTCCT GACAAGTGCA 
401 AATTCTTACA CCAGGAGAGG ATGGATGTTT GCGAAACTCA TCTTCACTGG 
451 CACACCGTCG CCAAAGAGAC ATGCAGTGAG AAGAGTACCA ACTTGCATGA 
501 CTACGGCATG TTGCTGCCCT GCGGAATTGA CAAGTTCCGA GGGGTAGAGT 
551 TTGTGTGTTG CCCACTGGCT GAAGAAAGTG ACAATGTGGA TTCTGCTGAT 
601 GCGGAGGAGG ATGACTCGGA TGTCTGGTGG GGCGGAGCAG ACACAGACTA 
651 TGCAGATGGG AGTGAAGACA AAGTAGTAGA AGTAGCAGAG GAGGAAGAAG 
701 TGGCTGAGGT GGAAGAAGAA GAAGCCGATG ATGACGAGGA CGATGAGGAT 
751 GGTGATGAGG TAGAGGAAGA GGCTGAGGAA CCCTACGAAG AAGCCACAGA 
801 GAGAACCACC AGCATTGCCA CCACCACCAC CACCACCACA GAGTCTGTGG 
851 AAGAGGTGGT TCGAGTTCCT ACAACAGCAG CCAGTACCCC TGATGCCGTT 
901 GACAAGTATC TCGAGACACC TGGGGATGAG AATGAACATG CCCATTTCCA 
951 GAAAGCCAAA GAGAGGCTTG AGGCCAAGCA CCGAGAGAGA ATGTCCCAGG 
1001 TCATGAGAGA ATGGGAAGAG GCAGAACGTC AAGCAAAGAA CTTGCCTAAA 
1051 GCTGATAAGA AGGCAGTTAT CCAGCATTTC CAGGAGAAAG TGGAATCTTT 
1101 GGAACAGGAA GCAGCCAACG AGAGACAGCA GCTGGTGGAG ACACACATGG 
1151 CCAGAGTGGA AGCCATGCTC AATGACCGCC GCCGCCTGGC CCTGGAGAAC 
1201 TACATCACCG CTCTGCAGGC TGTTCCTCCT CGGCCTCGTC ACGTGTTCAA 
1251 TATGCTAAAG AAGTATGTCC GCGCAGAACA GAAGGACAGA CAGCACACCC 
1301 TAAAGCATTT CGAGCATGTG CGCATGGTGG ATCCCAAGAA AGCCGCTCAG 
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1351 ATCCGGTCCC AGGTTATGAC ACACCTCCGT GTGATTTATG AGCGCATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA TTCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAATCTAGA TGCAGAATTC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAGTATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC 
2051 CAGGAAGTCA GCCTAAAACT GCTTGTACCA ATTGCTATTG TAAAAAGTGT 
2101 TGCTTTCATT GCCAAGTTTG TTTCATGACA AAAGCCTTAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGCGACGAAG AGCTCATCAG AACAGTCAGA 
2201 CTCATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCAGACGCCG CTGTCACCCC AGAGGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 
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(SEQ ID NO: 2) 

Amino acid sequence of APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 

ml pgl aim aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl sllynvpavaeeiqdevdel Iqkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gads vpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

nldaef rhdsgyevhhqyl vf f aedvgsnkga i i gl mvggvviatvi vi 1 1 vml kkk klgt e lgstspvwwns 

6 

•ad il epvdprl epwkhpgsqpktactncyckkccf hcqvcf mtkal gi syqrkkrrqrrrahqnsqthaasl ska 

7 8 
ri sstvQQQ daavtpeerhl skmqgngvenptvkf f eaman 

FIG. 3 
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DNA sequence of APP(l-651)wt. K612V-TATexon1(MlL) APP (664-695) 
(SEQ ID NO: 3) 



1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG 


AGCGCATGAA 
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1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1501 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA TTCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAAGATGGA TGCAGAATTC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAGTATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC 
2051 CAGGAAGTCA GCCTAAAACT GCTTGTACCA ATTGCTATTG TAAAAAGTGT 
2101 TGCTTTCATT GCCAAGTTTG TTTCATGACA AAAGCCTTAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGCGACGAAG AGCTCATCAG AACAGTCAGA 
2201 CTCATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCAGACGCCG CTGTCACCCC AGAGGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 
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(SEQ ID NO: 4) 

Amino acid sequence of APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695) 

ml pgl al 1 1 1 aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktcidtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygnfl 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rvi yermnqsl si 1 ynvpa vaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsfgadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

kmdaef rhdsgyevhhqyl vf f aedvgsnkgai i gl mvggyytatyi vi tl vml kkk kl gtcl gstspvwwns 

6 

a4 f 1 epvdprl epwkhpgsqpktactncyckkccf hcovcf mtkal gi sygrkkr rqrr rahansothoasl skq 

7 8 
ri sstvoaa daavtpeerhl skmqqngyenptykf feqmqn 

FIG. 5 
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DNA sequence of APP(1-651)SW, K612V-GAL4VP16(M1 L) APP (664-695) 
(SEQ ID NO: 5) 



1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


AC1TGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 
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1301 TAAAGCATTT CGAGCATGTG CGCATGGTGG ATCCCAAGAA AGCCGCTCAG 
1351 ATCCGGTCCC AGGTTATGAC ACACCTCCGT GTGATTTATG AGCGCATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA TTCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAATCTAGA TGCAGAATTC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAGTATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC AAGCTACTGT CTTCTATCGA ACAAGCATGC GATATTTGCC 
2051 GACTTAAAAA GCTCAAGTGC TCCAAAGAAA AACCGAAGTG CGCCAAGTGT 
2101 CTGAAGAACA ACTGGGAGTG TCGCTACTCT CCCAAAACCA AAAGGTCTCC 
2151 GCTGACTAGG GCACATCTGA CAGAAGTGGA ATCAAGGCTA GAAAGACTGG 
2201 AACAGCTATT TCTACTGATT TTTCCTCGAG AAGACCTTGA CATGATTTTG 
2251 AAAATGGATT CTTTACAGGA TATAAAAGCA TTGTTAACAG GATTATTTGT 
2301 ACAAGATAAT GTGAATAAAG ATGCCGTCAC AGATAGATTG GCTTCAGTGG 
2351 AGACTGATAT GCCTCTAACA TTGAGACAGC ATAGAATAAG TGCGACATCA 
2401 TCATCGGAAG AGAGTAGTAA CAAAGGTCAA AGACAGTTGA CTGTATCGGG 
2451 AATTCCCGGG GATCTGGCCC CCCCGACCGA TGTCAGCCTG GGGGACGAGC 
2501 TCCACTTAGA CGGCGAGGAC GTGGCGATGG CGCATGCCGA CGCGCTAGAC 
2551 GATTTCGATC TGGACATGTT GGGGGACGGG GATTCCCCGG GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 

FIG.6C 
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(SEQ ID NO: 6) 

Amino acid sequence of APPU-65DSW, K612V r GAL4-VP16(delMl) APP (664-695) 

ml pgl al m aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgt ktci dt kegi 1 qycq 

evypel qi tn vveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcet hi h 

whtvaketcsekstnl hdygml 1 pcgidkf rgvefvccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si 1 ynvpavaeeiqdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetktt vel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nldaef rhdsgyevhhqyl vf f aedvgsnkgai igl mvggyviatyi vi tl vml kkk kl gtcl gstspvwwns 

ad ikl 1 ssi egacdi crl kkl kcskekpkcakcl knnwecrvspktkrspl trahl tevesrl erl eql f 1 1 i f predl d 

6 

mi 1 kmdsl qdi kal 1 tgl f vqdnvnkda vtdrl asvetdmpl tl rqhri satssseessnkgqrql tvsgi pgdl app 

tdvsl qdel hi dgedvamahadal ddf dl dml gdgdspgpgf tphdsapygal dmadf ef eomf tdal qi dev 

7 8 
qqdiqhsgaaa daavtpeerhlskmqqnqyenptykffeqmqn 

FIG. 7 
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DNA sequence of APP(l-651)wt, K612V, GAL4-VP16(deIMl) APP (664-695) 



(SEO 


ID NO- 7) 








I 

J. 


ATGCTGCCCG 


GTTTGGCACT GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 
•J J- 


GCTGGAGGTA 


rrr aptgatg gtaatgptgg 


rrTGTTGGfT 


GAACCCCAGA 

unnw v V/ vnun 


101 


TTGCCATGTT 

1 l VJUUn lull 


PTGTGGf AGA PTGAAfATGr 


AfATGAATGT 

nun t unn I \2 I 


CCAGAATGGG 


151 


AAGTGGGATT 


f AGATf f ATf AGGGACCAAA 

\jr\\Jr\ I bl»n 1 O nvjuunoonnn 


ACCTGCATTG 


ATACCAAGGA 

n i nv wnnvjvjn 


201 


AGGCATCCTG 


CAGTATTGCC AAGAAGTCTA 


CCCTGMCTG 


CAGATCACCA 

unvj/i i wnww#» 


251 


ATGTGGTAGA 

n i u i i nun 


AGCCAACCAA CCAGTGACCA 


TCCAGAACTG 

i wununnu i w 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC CCATCCCCAC 

nu i uunnwnvy uun i uooonw 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA GTGATGCCCT 

unu i i i n i nn \j i van i uuvu i 


TCtCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG ATGGATGTTT 

vunuununuu n i vj win i u i i i 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT GCGGAATTGA 

1 ft MV 1 V4WV 1 UwUU/ if 1 1 1 V-*/ 1 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 

1 i U V \\*A VI till 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG 


AGCGCATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 


APAAPGTGPP 


TGPAGTGGPP 

1 VJOMVJ 1 uuUO 


GAGGAGATTP 


1451 


AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTPAGATGAP 


1501 


GTCTTGGCCA 

vj i u i i viva uur\ 


APATGATTAG 


TGAAPPAAGG 


ATPAGTTAPG 


GAAAPGATGP 


1551 


TCTCATGCCA 


TCTTTGACCG 


AAAPGAAAAP 

/A/Art ovjMMMMO 


PAPPGTGGAG 


PTPPTTPPPG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GAPGATPTPP 

vjmovjm i u i uur 


AGPPfTTGGPA 

MVJUUVj J VjUUM 


TTP 1 I 1 1 GGG 


1651 


GCTGACTCTG 


TGCCAGCCAA 


PAPAGAAAAP 


GAAGTTGAGP 


PTGTTGATGP 

u 1 Vj l l vjr\ I UU 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGGGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAAGATGGA 


TGCAGAATTC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAGTATTGG 


TGTTCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGTGT 


GGTGGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT 


CTTCTATCGA 


ACAAGCATGC 


GATATTTGCC 


2051 


GACTTAAAAA 


GCTCAAGTGC 


TCCAAAGAAA 


AACCGAAGTG 


CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG 


TCGCTACTCT 


CCCAAAACCA 


AAAGGTCTCC 

/\n#i\JVJ i \y l UO 


2151 


GCTGACTAGG 


GCACATCTGA 


CAGAAGTGGA 


ATCAAGGCTA 


GAAAGACTGG 


2201 


AACAGCTATT 


TCTACTGATT 


TTTCCTCGAG 


AAGACCTTGA 


CATGAI 1 1 IG 


2251 


AAA A Tr>r* A I T* 

AAAATGGATT 


CTTTACAGGA 


TATAAAAGCA 


TTGTTAACAG 


GATTATTTGT 


2301 


ACAAGATAAT 


GTGAATAAAG 


ATGCCGTCAC 


AGATAGATTG 


GCTTCAGTGG 


2351 


AGACTGATAT 


GCCTCTAACA 


TTGAGACAGC 


ATAGAATAAG 


TGCGACATCA 


2401 


TCATCGGAAG 


AGAGTAGTAA 


CAAAGGTCAA 


AGACAGTTGA 


CTGTATCGGG 


2451 


AATTCCCGGG 


GATCTGGCCC 


CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACTTAGA 


CGGCGAGGAC 


GTGGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GATTTCGATC 


TGGACATGTT 


GGGGGACGGG 


GATTCCCCGG 


GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 

FIG.8C 
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(SEQ ID NO: 8) 

Amino acid sequence of APP(l-651)wt, K612V, GAL4-VP16(delHl) APP (654-695) 
ml pgl allll aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 
evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 
whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 



edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsq vmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rvi yermnqsl si 1 ynvpa vaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

kmdaef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvqg vvi atv i vi tl vml kk kkl gtel gstspvwwns 

a4 fkl 1 ssi eqacdi crl kkl kcskekpkcakcl knnwecryspktkrspl trahl tevesrl erl eol f 1 1 i f predl d 

6 

mi 1 kmdsl qdi kal 1 tgl f vadnvnkdavtdrl asvetdmpl tl rqhri satssseessnkgqrql t vsgi pgdl app 

tdvsl gdel hi dgedvamahadal ddfdl dm! gdgdspgpqftphdsapygal dmadf ef eqmf tdal gi dev 
7 8 



1 




FIG. 9 



WO 03/072041 



PCI7US03/05458 



16/76 

DNA sequence of APPU-65DSW, TATexonl (Ml L) APP (664-695) 



(SEQ 


ID NO: 9) 








1 


ATGCTGCCCG 


GTTTGGCACT GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG GTAATGCTGG 


CGTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG 


AGCGCATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


GAGGAGATTC 


1451 


AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1501 


GTCTTGGCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATGCCA 


TCTTTGACCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 


AGCCGTGGCA 


TTCI 1 1 IGGG 


1651 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


GAAGTTGAGC 


CTGTTGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGGGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAATCTAGA 


TGCAGAA1TC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAAAATTGG 


TGTTCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGTGT 


GGTGGAATTC 


2001 


TGCAGATATC 


CTGGAGCCAG 


TAGATCCTAG 


ACTAGAGCCC 


TGGAAGCATC 


one i 

2051 


LAuuAAu 1 LA 


fV^TA AAA f*T 

uLLIAAAALI 


GCTTGIACCA 


A TTPrT a "i — rr* 

AITGCTATTG 


TA A A A A PTOT 

TAAAAAGTGT 


2101 


TGCTTTCATT 


GCCAAGTTTG 


TTTCATGACA 


AAAGCCTTAG 


GCATCTCCTA 


2151 


TGGCAGGAAG 


AAGCGGAGAC 


AGCGACGAAG 


AGCTCATCAG 


AACAGTCAGA 


2201 


CTCATCAAGC 


TTCTCTATCA 


AAGCAGAGGA 


TATCCAGCAC 


AGTGGCGGCC 


2251 


GCAGACGCCG 


CTGTCACCCC 


AGAGGAGCGC 


CACCTGTCCA 


AGATGCAGCA 


2301 


GAACGGCTAC 


GAAAATCCAA 


CCTACAAGTT 


CTTTGAGCAG 


ATGCAGAACT 


2351 


AG 
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(SEQ ID NO: 10) 

» Amino acid sequence of APP(1-651)SW, TATexonKMIL) APP (664-695) 

ml pgl aim aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tn vveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcet hi h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdn vdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyletpgdenehahfqkakerleakhrennsqvmreweeaerqaknl pkadkkaviqhfqekvesleqe 

aanerqql vethmarveaml ndrrrl alenyi talqavpprprhvf nml kkyvraeqkdrqhtl khfehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si 1 ynvpavaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsltetkttvellpvngefslddlqpwhsfgadsvpantenevepvdarpaadrglttrpgsgltnikteeisev 
2 3 4 5 

nl daef rhdsgvevhhqkl vf f aedvgsnkga i i gl mvgg vvi atv i vi tl vml kk kkl gtcl gstspvwwns 

6 

d4f1 epvdprl epwkhpgsgpktactncvckkccf hcqvcfmtkal gi sygrkkrrqrrrahqnsqthqasl skq 

7 8 
pi sstvaaa daavtpeerhl skmqqngyenptykf feqmqn 

FIG. 11 
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DNA sequence of APP(l-651)wt, TATexonl (Ml L) -APP (664-695) 



(SEQ 


ID NO: 11) 










1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTG6AGGTA 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG 


AGCGCATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


GAGGAGATTC 


1451 


AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1501 


GTCTTGGCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATGCCA 


TCTTTGAGCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 


AGCCGTGGCA 


TTCIIIIGGG 


1651 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


GAAGTTGAGC 


CTGTTGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGGGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAAGATGGA 


TGCAGAATTC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAAAATTGG 


TGTTCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGTGT 


GGTGGAATTC 


2001 


TGCAGATATC 


CTGGAGCCAG 


TAGATCCTAG 


ACTAGAGCCC 


TGGAAGCATC 


2051 


CAGGAAGTCA 


GCCTAAAACT 


GCTTGTACCA 


ATTGCTATTG 


TAAAAAGTGT 


2101 


TGCTTTCATT 


GCCAAGTTTG 


TTTCATGACA 


AAAGCCTTAG 


GCATCTCCTA 


2151 


TGGCAGGAAG 


AAGCGGAGAC 


AGCGACGAAG 


AGCTCATCAG 


AACAGTCAGA 


2201 


CTCATCAAGC 


TTCTCTATCA 


AAGCAGAGGA 


TATCCAGCAC 


AGTGGCGGCC 


2251 


GCAGACGCCG 


CTGTCACCCC 


AGAGGAGCGC 


CACCTGTCCA 


AGATGCAGCA 


2301 


GAACGGCTAC 


GAAAATCCAA 


CCTACAAGTT 


CTTTGAGCAG 


ATGCAGAACT 


2351 


AG 
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(SEQ ID NO: 12) 

Amino acid sequence of APP(l-651)wt, TATexonl(MlL) APP (664-695) 

ml pgl aim aawta ral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dt kegi 1 qycq 

evypel qi tnvveanqpvt i qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrri alenyi tal qavpprprhvfnml kkyvraeqkdrqhtl khfehvrmvd 

pkkaaqi rsqvmthl rvi yermnqsl si lynvpavaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsfgadsvpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

kmdaef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvgg vvi atv i vi tl vml kkk kl gtcl gstspvwwns 

6 

ad 41 epvdprl epwkhpqsqpktactncvckkccf hcqvcfmtkalgi svqrkkrrqrrrahqnsqthqasl skq 

7 8 
ri sstvaao daavtpeerhl skmoangyenptykf f eqmqn 
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DNA sequence of APP(1-651)SW, GAL4VP16(delMet) APP (664-695) 
(SEQ ID NO: 13) 



1 ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 GCTGGAGGTA 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


MGTATGTCC 


GCGCAGAACA 


GAAGGAPAGA PAGPACAPCC 


1301 


TAAAGPATTT 


CGAGCATGTG 


CGCATGGTGG 


ATPPPAAGAA AGPPGPTPAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG AGPGPATGAA 


1401 


TPAGTCTPTP 


TCCCTGCTCT 


ACAACGTGCC 


TGPAGTGGPP GAGGAGATTP 


1451 


AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGPAAAAPTA TTPAGATGAP 


1501 

1JU1 


GTPTTGGPPA 


ACATGATTAG 


TGAACCAAGG 


ATPAGTTAPG ftAAAPGATGP 


1551 

X»J<J X 


TPTPATGPPA 


TCTTTGACCG 


AAACGAAAAC 


PAPPGTGGAG PTPPTTPPPG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 


AGPPGTGGPA TTPI 1 1 IGGG 


1651 


GPTGAPTPTG 


TGCCAGCCAA 


CACAGAAAAC 


GAAGTTGAGP PTGTTGATCP 


1701 


ppgppptgpt 


GCCGACCGAG 


GACTGACCAC 


TPGAPPAGGT TPTGGGTTGA 


1751 

X / wJX 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAATPTAGA TGPAGAATTP 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAAAA7TGG TGTTCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


PATTGGAPTP ATGGTGGGPG 


1901 


GTGTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGTGT GGTGGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT 


CTTCTATCGA 


ACAAGCATGC GATATTTGCC 


2051 


GACTTAAAAA 


GCTCAAGTGC 


TCCAAAGAAA 


AACCGAAGTG CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG 


TCGCTACTCT 


CCCAAAACCA AAAGGTCTCC 


2151 


GCTGACTAGG 


GCACATCTGA 


CAGAAGTGGA 


ATCAAGGCTA GAAAGACTGG 


2201 


AACAGCTATT 


TCTACTGATT 


TTTCCTCGAG 


AAGACCTTGA GATGA 1 1 1 1 G 


2251 


AAAATGGATT 


will M^nuun 


TATAAAAGPA 

1 r\ 1 nnnnUUn 


TTGTTAACAG GATTATTTGT 


2301 


ACAAGATAAT 


GTGAATAAAG 


ATGCCGTCAC 


AGATAGATTG GCTTCAGTGG 


2351 


AGACTGATAT 


GCCTCTAACA 


TTGAGACAGC 


ATAGAATAAG TGCGACATCA 


2401 


TCATCGGAAG 


AGAGTAGTAA 


CAAAGGTCAA 


AGACAGTTGA CTGTATCGGG 


2451 


AATTCCCGGG 


GATCTGGCCC 


CCCCGACCGA 


TGTCAGCCTG GGGGACGAGC 


2501 


TCCACTTAGA 


CGGCGAGGAC 


GTGGCGATGG 


CGCATGCCGA CGCGCTAGAC 


2551 


GATTTCGATC 


TGGACATGTT 


GGGGGACGGG 


GATTCCCCGG GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGGCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 

FIG.14C 
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(SEQ ID NO: 14) 

Amino acid sequence of APPU-65DSW. K612V, GAL4-VP16(delMl) APP (664-695) 

ml pgl aim aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kky vraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeeiqdevdellqkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nldaef rhdsgyevhhqkl vf f aedvgsnkgai i glmvgq vvi atv i vi tl vml kkk kl gtcl gstspvwwns 

^4 -kl 1 ssi eqacdi crl kkl kcskekpkcakcl knnwecrvspktkrspl t rahl tevesrl erl egl f 1 1 i f predl d 

6 

mi 1 kmdsl qdi kal 1 tgl f vadnvnkdavtdrl asvetdmpl tl rqhri sat ssseessnkgqrql tvsgi pgdl app 

tdvsl gdel hi dgedvamahadal ddf dl dml gdgdspgpgf tphdsapygal dmadf ef eqmf tdal gi dev 
7 8 
^daavtpeerhl skmqqngyenptvkf feqmqn 



FIG. 15 



WO 03/072041 



PCT/DS03/05458 



26/76 



DNA sequence of APP(l-651)wt, GAL4VP16(delMet) APP (664-695) 
(SEQ ID NO: 15) 



1 ATGCTGCCCG 

X n 1 uu t VJVsVsV/VJ 


GTTTGGCACT 

vj i i i uuunu i 


GCTCCTGCTG 

VJV«» I ww 1 uw I VJ 


GCCGCCTGGA 

ULrwUwo i vjvjn 


CGGCTCGGGC 

VsVJVJVs 1 VsVJUVJVs 


51 GPTGGAGGTA 

J x uu 1 uvjnuu 1 r\ 


PPPAPTGATG 

v/v»vfr\V/ 1 Vjn 1 vj 


GTAATGPTGG 


PPTGPTGGPT 

w v» 1 UU 1 UUw 1 


GAAPPPPAGA 

vjnnOv^ownvjn 


101 TTGPPATGTT 

1U1 1 1 UOL-n 1 u 1 1 


PTGTGGPAGA 

\s 1 vj 1 uvjvnvjn 


PTGAACATGP 


APATGAATGT 

nun I vjnn I vj i 


PPAGAATGGG 

ovVAvjnn 1 vjvjvj 


151 AAGTGGGATT 

1J1 nnvj I Vjuun I I 


CAGATCCATC 

VWiVJn 1 uw/l 1 \j 


AGGGACCAAA 

nvj uunv* v/nnn 


ACCTGCATTG 

nov* i vjun ii i vj 


ATACCAAGGA 

n i nwwnnuun 


9M AGGPATPPTG 

CJ\J X nvjVJL»rA 1 Ov 1 vj 


PAGTATTGCC 

v^nvj i n i i uv/b 


AAGAAGTCTA 

nnunnu io in 


CCCTGAACTG 


CAGATCACCA 


951 ATGTGGTAGA 

lJI n 1 vj 1 Vjvj 1 nun 


AGPPAAPPAA 


PPAGTGACCA 


TPCAGAACTG 

i Vjununnu i vj 


GTGCAAGCGG 

vj i vjv>nnvjV/VJVJ 


lf)1 GGffGPAAGf 

OUl vjUVvvVjVnnVJVy 


AGTGCAAGAC 

mvj I vjv-rnnvjnvv 


CCATCCCCAC 


TTTGTGATTC 

1 1 1 VJ 1 VJn 1 1 V-r 


CCTACCGCTG 


151 CTTAGTTGGT 

J J JL Oil nVJ 1 1 VJVJ | 


GAGTTTATAA 


GTGATGCCCT 

vj i un i wvww i 


TCTCGTTCCT 

1 w 1 UU 1 1 w w 1 


GACAAGTGCA 


401 AATTCTTACA 

^ V X »vn I I v I I nun 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 TATGCTAAAG AAGTATGTCC GCGCAGAACA GAAGGACAGA CAGCACACCC 
1301 TAAAGCATTT CGAGCATGTG CGCATGGTGG ATCCCAAGAA AGCCGCTCAG 
1351 ATCCGGTCCC AGGTTATGAC ACACCTCCGT GTGATTTATG AGCGCATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA TTCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAAGATGGA TGCAGAATTC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAAAATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC AAGCTACTGT CTTCTATCGA ACAAGCATGC GATATTTGCC 
2051 GACTTAAAAA GCTCAAGTGC TCCAAAGAAA AACCGAAGTG CGCCAAGTGT 
2101 CTGAAGAACA ACTGGGAGTG TCGCTACTCT CCCAAAACCA AAAGGTCTCC 
2151 GCTGACTAGG GCACATCTGA CAGAAGTGGA ATCAAGGCTA GAAAGACTGG 
2201 AACAGCTATT TCTACTGATT TTTCCTCGAG AAGACCTTGA CATGATTTTG 
2251 AAAATGGATT CTTTACAGGA TATAAAAGCA TTGTTAACAG GATTATTTGT 
2301 ACAAGATAAT GTGAATAAAG ATGCCGTCAC AGATAGATTG GCTTCAGTGG 
2351 AGACTGATAT GCCTCTAACA TTGAGACAGC ATAGAATAAG TGCGACATCA 
2401 TCATCGGAAG AGAGTAGTAA CAAAGGTCAA AGACAGTTGA CTGTATCGGG 
2451 AATTCCCGGG GATCTGGCCC CCCCGACCGA TGTCAGCCTG GGGGACGAGC 
2501 TCCACTTAGA CGGCGAGGAC GTGGCGATGG CGCATGCCGA CGCGCTAGAC 
2551 GATTTCGATC TGGACATGTT GGGGGACGGG GATTCCCCGG GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 

FIG.16C 
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(SEQ ID NO: 16) 

Amino acid sequence of APP(l-651)wt, K612V, GAL4-VP16(de!Ml) APP (664-695) 
ml pgl aim aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 
evypel qi tnvveanqpvt i qnwckrgrkqckt hphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 
whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 



edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khfehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si 1 ynvpavaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

kmdaef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvqq vviatv i vi tl vml kkk kl gtcl gstspvwwns 

ad tkl 1 ssi eoacdi crl kkl kcskekpkcakcl knnwecryspktkrspl trahl tevesrl erl eg! f 1 1 i f predl d 

6 

mi 1 kmdsl qdi kal 1 tgl f vqdnvnkdavtdrl asvetdmpl tl rqhri satssseessnkgorql tvsgi pgdl app 
tdvsl gdel hi dgedvamahadal ddfdl dml gdgdspgpgf tphdsapygal dmadf ef eqmf tdal gi dev 
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(SEQ ID NO: 17) 



6 

12 
18 
24 
30 
36 
42 
48! 
54 
60 
66 
72 
78 
84 
90 
96 
102 
108 
114 
120 
126 
132 
138 
144 
150 
156 
162 
168 
174 
180 
186 
192 
198 
204 
210 
216 
222 
228 
234 
240 
246 
252 
258 
264: 
270 
276 



agtttcctcg 
acggcggcgg 
gcactcggtg 
gcctggacgg 
ccccagattg 
tgggattcag 
tattgccaag 
gtgaccatcc 
gtgattccct 
aagtgcaaat 
accgtcgcca 
ctgccctgcg 
gaaagtgaca 
ggagcagaca 
gaagaagtgg 
gatgaggtag 
attgccacca 
acagcagcca 
gaacatgccc 
tcccaggtca 
gataagaagg 
gccaacgaga 
gaccgccgcc 
cctcgtcacg 
cacaccctaa 
cggtcccagg 
ctgctctaca 
cagaaagagc 
agttacggaa 
cttcccgtga 
gactctgtgc 
gaccgaggac 
tctgaagtga 
aaattggtgt 
gtgggcggtg 
cagtacacat 
cgccacctgt 
cagatgcaga 
ttcactaccc 
atttactcat 
attaatccac 
ttattaatgg 
attctctcct 
tgtgacccaa 
tgaacgtggg 
tttaaagtta 
gcattttact 



gcagcggtag 
tggcggcgcg 
ccccgcgcag 
ctcgggcgct 
ccatgttctg 
atccatcagg 
aagtctaccc 
agaactggtg 
accgctgctt 
tcttacacca 
aagagacatg 
gaattgacaa 
atgtggattc 
cagactatgc 
ctgaggtgga 
aggaagaggc 
ccaccaccac 
gtacccctga 
atttccagaa 
tgagagaatg 
cagttatcca 
gacagcagct 
gcctggccct 
tgttcaatat 
agcatttcga 
ttatgacaca 
acgtgcctgc 
aaaactattc 
acgatgctct 
atggagagtt 
cagccaacac 
tgaccactcg 
agatggatgc 
tctttgcaga 
ttgtcatagc 
ccattcatca 
ccaagatgca 
actagacccc 
atcggtgtcc 
tatcgccttt 
acatcagtaa 
gttttgtgta 
gattatttat 
ttaagtccta 
agttcagctg 
aacattttta 
gtacagattg 



gcgagagcac 
ggcagagcaa 
ggtcgcgatg 
ggaggtaccc 
tggcagactg 
gaccaaaacc 
tgaactgcag 
caagcggggc 
agttggtgag 
ggagaggatg 
cagtgagaag 
gttccgaggg 
tgctgatgcg 
agatgggagt 
agaagaagaa 
tgaggaaccc 
caccacagag 
tgccgttgac 
agccaaagag 
ggaagaggca 
gcatttccag 
ggtggagaca 
ggagaactac 
gctaaagaag 
gcatgtgcgc 
cctccgtgtg 
agtggccgag 
agatgacgtc 
catgccatct 
cagcctggac 
agaaaacgaa 
accaggttct 
agaattccga 
agatgtgggt 
gacagtgatc 
tggtgtggtg 
gcagaacggc 
cgccacagca 
atttatagaa 
tgacagctgt 
tgtattctat 
ctgtaaagaa 
cacatagccc 
ctttacatat 
cttctcttgc 
agtatttcag 
ctgcttctgc 



gcggaggagc 
ggacgcggcg 
ctgcccggtt 
actgatggta 
aacatgcaca 
tgcattgata 
atcaccaatg 
cgcaagcagt 
tttgtaagtg 
gatgtttgcg 
agtaccaact 
gtagagtttg 

gaggaggatg 

gaagacaaag 
gccgatgatg 
tacgaagaag 
tctgtggaag 
aagtatctcg 
aggcttgagg 
gaacgtcaag 
gagaaagtgg 
cacatggcca 
atcaccgctc 
tatgtccgcg 
atggtggatc 
atttatgagc 
gagattcagg 
ttggccaaca 
ttgaccgaaa 
gatctccagc 
gttgagcctg 
gggttgacaa 
catgactcag 
tcaaacaaag 
gtcatcacct 
gaggttgacg 
tacgaaaatc 
gcctctgaag 
taatgtggga 
gctgtaacac 
ctctctttac 
tttagctgta 
cttagccagt 
gctttaagaa 
ctaagtattc 
atgctttaga 
tatatttgtg 



gtgcgcgggg 
gatcccactc 
tggcactgct 
atgctggcct 
tgaatgtcca 
ccaaggaagg 
tggtagaagc 
gcaagaccca 
atgcccttct 
aaactcatct 
tgcatgacta 
tgtgttgccc 
actcggatgt 
tagtagaagt 
acgaggacga 
ccacagagag 
aggtggttcg 
agacacctgg 
ccaagcaccg 
caaagaactt 
aatctttgga 
gagtggaagc 
tgcaggctgt 
cagaacagaa 
ccaagaaagc 
gcatgaatca 
atgaagttga 
tgattagtga 
cgaaaaccac 
cgtggcattc 
ttgatgcccg 
atatcaagac 
gatatgaagt 
gtgcaatcat 
tggtgatgct 
ccgctgtcac 
caacctacaa 
ttggacagca 
agaaacaaac 
aagtagatgc 
attttggtct 
tcaaactagt 
tgtatattat 
tcgatggggg 
ctttcctgat 
gagatttttt 
atataggaat 



gccccgggag 
gcacagcagc 
cctgctggcc 
gctggctgaa 
gaatgggaag 
catcctgcag 
caaccaacca 
tccccacttt 
cgttcctgac 
tcactggcac 
cggcatgttg 
actggctgaa 
ctggtggggc 
agcagaggag 
tgaggatggt 
aaccaccagc 
agttcctaca 
ggatgagaat 
agagagaatg 
gcciaaagct 
acaggaagca 
catgctcaat 
tcctcctcgg 
ggacagacag 
cgctcagatc 
gtctctctcc 
tgagctgctt 
accaaggatc 
cgtggagctc 
ttttggggct 
ccctgctgcc 
gga'ggagatc 
tcatcatcaa 
tggactcatg 
gaagaagaaa 
cccagaggag 
gttctttgag 
aaaccattgc 
ccgttttatg 
ctgaacttga 
ctatactaca 
gcatgaatag 
tcttgtggtt 
atgcttcatg 
cactatgcat 
ttccatgact 
taagaggata 
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2821 cacacgtttg tttcttcgtg cctgttttat 
2881 ttttcttttt ttgtccacgt atctttgggt 
2941 gtaagcactt ttacggggcg ggtggggagg 
3001 ttctccaaaa caattttctg caggatgatt 
3061 tttctacact gtattacata aataaattaa 
3121 gaaggatgac tacagacatt aaataatcga 
3181 tcaattttct ttaaccagtc tgaagtttca 
3241 gtggcaatat aaggggatga ggaaggcatg 
3301 ttcaatttgt ataaaatggt gttttcatgt 



gtgcacacat taggcattga gacttcaagc 
ctttgataaa gaaaagaatc cctgttcatt 
ggtgctctgc tggtcttcaa ttaccaagaa 
gtacagaatc attgcttatg acatgatcgc 
ataaaataac cccgggcaag acttttcttt 
agtaattttg ggtggggaga agaggcagat 
tttatgatac aaaagaagat gaaaatggaa 
cctggacaaa cccttctttt aagatgtgtc 
aaataaatac attcttggag gage 
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(SEQ ID NO: 18) 

MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMH 

MNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGR 

KQCIOHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSE 

KSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYA 

DGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTT 

T1TTESVEEVVRVPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQV 

MREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLND 

RRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQ 

IRSQVMTHLRVIYERMMQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISE 

PRISYGNDALMPSLTETXTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVD 

ARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNK 

GAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGY 

ENPTYKFFEQMQN 
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(SEQ ID NO: 19 AND 20) 

1 ATGGTGAGCA AGGGCGAGGA GCTGTTCACC GGGGTGGTGC CCATCCTGGT 

TACCACTCGT TCCCGCTCCT CGACAAGTGG CCCCACCACG GGTAGGACCA 
51 CGAGCTGGAC GGCGACGTAA ACGGCCACAA GTTCAGCGTG TCCGGCGAGG 

GCTCGACCTG CCGCTGCATT TGCCGGTGTT CAAGTCGCAC AGGCCGCTCC 
101 GCGAGGGCGA TGCCACCTAC GGCAAGCTGA CCCTGAAGTT CATCTGCACC 

CGCTCCCGCT ACGGTGGATG CCGTTCGACT GGGACTTCAA GTAGACGTGG 
151 ACCGGCAAGC TGCCCGTGCC CTGGCCCACC CTCGTGACCA CCTTCACCTA 

TGGCCGTTCG ACGGGCACGG GACCGGGTGG GAGCACTGGT GGAAGTGGAT 
201 CGGCGTGCAG TGCTTCGCCC GCTACCCCGA CCACATGAAG CAGCACGACT 

GCCGCACGTC ACGAAGCGGG CGATGGGGCT GGTGTACTTC GTCGTGCTGA 
251 TCTTCAAGTC CGCCATGCCC GAAGGCTACG TCCAGGAGCG CACCATCTTC 

AGAAGTTCAG GCGGTACGGG CTTCCGATGC AGGTCCTCGC GTGGTAGAAG 
301 TTCAAGGACG ACGGCAACTA CAAGACCCGC GCCGAGGTGA AGTTCGAGGG 

AAGTTCCTGC TGCCGTTGAT GTTCTGGGCG CGGCTCCACT TCAAGCTCCC 
351 CGACACCCTG GTGAACCGCA TCGAGCTGAA GGGCATCGAC TTCAAGGAGG 

GCTGTGGGAC CACTTGGCGT AGCTCGACTT CCCGTAGCTG AAGTTCCTCC 
401 ACGGCAACAT CCTGGGGCAC AAGCTGGAGT ACAACTACAA CAGCCACAAG 

TGCCGTTGTA GGACCCCGTG TTCGACCTCA TGTTGATGTT GTCGGTGTTC 
451 GTCTATATCA CCGCCGACAA GCAGAAGAAC GGCATCAAGG TGAACTTCAA 

CAGATATAGT GGCGGCTGTT CGTCTTCTTG CCGTAGTTCC ACTTGAAGTT 
501 GACCCGCCAC AACATCGAGG ACGGCAGCGT GCAGCTCGCC GACCACTACC 

CTGGGCGGTG TTGTAGCTCC TGCCGTCGCA CGTCGAGCGG CTGGTGATGG 
551 AGCAGAACAC CCCCATCGGC GACGGCCCCG TGCTGCTGCC CGACAACCAC 

TCGTCTTGTG GGGGTAGCCG CTGCCGGGGC ACGACGACGG GCTGTTGGTG 
601 TACCTGAGCA CCCAGTCCGC CCTGAGCAAA GACCCCAACG AGAAGCGCGA 

ATGGACTCGT GGGTCAGGCG GGACTCGTTT CTGGGGTTGC TCTTCGCGCT 
651 TCACATGGTC CTGCTGGAGT TCGTGACCGC CGCCGGGATC ACTCTCGGCA 

AGTGTACCAG GACGACCTCA AGCACTGGCG GCGGCCCTAG TGAGAGCCGT 
701 TGGACGAGCT GTACAAGTAA CTCGAGTCTA GAGGGCCCGT TTAAACCCGC 

ACCTGCTCGA CATGTTCATT GAGCTCAGAT CTCCCGGGCA AATTTGGGCG 
751 TGATCAGCCT CGACTGTGCC TTCTAGTTGC CAGCCATCTG TTGTTTGCCC 

ACTAGTCGGA GCTGACACGG AAGATCAACG GTCGGTAGAC AACAAACGGG 
801 CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG TGCCACTCCC ACTGTCCTTT 

GAGGGGGCAC GGAAGGAACT GGGACCTTCC ACGGTGAGGG TGACAGGAAA 
851 CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG GTGTCATTCT 

GGATTATTTT ACTCCTTTAA CGTAGCGTAA CAGACTCATC CACAGTAAGA 
901 ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 

TAAGACCCCC CACCCCACCC CGTCCTGTCG TTCCCCCTCC TAACCCTTCT 
951 CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGCT TCTGAGGCGG 

GTTATCGTCC GTACGACCCC TACGCCACCC GAGATACCGA AGACTCCGCC 
1001 AAAGAACCAG CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA 

TTTCTTGGTC GTACACTCGT TTTCCGGTCG TTTTCCGGTC CTTGGCATTT 
1051 AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA 

TTCCGGCGCA ACGACCGCAA AMGGTATCC GAGGCGGGGG GACTGCTCGT 
1101 TCAC AAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT 

AGTGTTTTTA GCTGCGAGTT CAGTCTCCAC CGCTTTGGGC TGTCCTGATA 
1151 AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 

TTTCTATGGT CCGCAAAGGG GGACCTTCGA GGGAGCACGC GAGAGGACAA 
1201 CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG 

GGCTGGGACG GCGAATGGCC TATGGACAGG CGGAAAGAGG GAAGCCCTTC 
1251 CGTGGCGCTT TCTCATAGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 

GCACCGCGAA AGAGTATCGA GTGCGACATC CATAGAGTCA AGCCACATCC 
1301 TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC 

AGCAAGCGAG GTTCGACCCG ACACACGTGC TTGGGGGGCA AGTCGGGCTG 
1351 CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA 

CCGACGCGGA ATAGGCCATT GATAGCAGAA CTCAGGTTGG GCCATTCTGT 
1401 CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 

GCTGAATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA TCGTCTCGCT 
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1451 GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC 

CCATACATCC GCCACGATGT CTCAAGAACT TCACCACCGG ATTGATGCCG 
1501 TACACTAGAA GAACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC 

ATGTGATCTT CTTGTCATAA ACCATAGACG CGAGACGACT TCGGTCAATG 
1551 CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 

GAAGCCTTTT TCTCAAC CAT CGAGAACTAG GCCGTTTGTT TGGTGGCGAC 
1601 GTAGCGGTGG 1 1 1 1 1 1 IGTT TGCAAGCAGC AGATTACGCG CAG AAAAAAA 

CATCGCCACC AAAAAAACAA ACGTTCGTCG TCTAATGCGC GTC IIII I II 
1651 GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG 

CCTAGAGTTC TTCTAGGAAA CTAGAAAAGA TGCCCCAGAC TGCGAGTCAC 
1701 GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGACATTA ACCTATAAAA 

CTTGCTTTTG AGTGCAATTC CCTAAAACCA GTACTGTAAT TGGATATTTT 
1751 ATAGGCGTAT CACGAGGCCC TTTCGTCTCG CGCGTTTCGG TGATGACGGT 

TATCCGCATA GTGCTCCGGG AAAGCAGAGC GCGCAAAGCC ACTACTGCCA 
1801 GAAAACCTCT GACACATGCA GCTCCCGGAG ACGGTCACAG CTTGTCTGTA 

CTTTTGGAGA CTGTGTACGT CGAGGGCCTC TGCCAGTGTC GAACAGACAT 
1851 AGCGGATGCC GGGAGCAGAC AAGCCCGTCA GGGCGCGTCA GCGGGTGTTG 

TCGCCTACGG CCCTCGTCTG TTCGGGCAGT CCCGCGCAGT CGCCCACAAC 
1901 GCGGGTGTCG GGGCTGGCTT AACTATGCGG CATCAGAGCA GATTGTACTG 

CGCCCACAGC CCCGACCGAA TTGATACGCC GTAGTCTCGT CTAACATGAC 
1951 AGAGTGCACC ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA 

TCTCACGTGG TATACGCCAC ACTTTATGGC GTGTCTACGC ATTCCTCTTT 
2001 ATACCGCATC AGGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT 

TATGGCGTAG TCCTGCGCGG GACATCGCCG CGTAATTCGC GCCGCCCACA 
2051 GGTGGTTACG CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG 

CCACCAATGC GCGTCGCACT GGCGATGTGA ACGGTCGCGG GATCGCGGGC 
2101 CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG CCACGrfCGC CGGCTTTCCC 

GAGGAAAGCG AAAGAAGGGA AGGAAAGAGC GGTGCAAGCG GCCGAAAGGG 
2151 CGTCAAGCTC TAAATCGGGG GCTCCCTTTA GGGTTCCGAT TTAGTGCTTT 

GCAGTTCGAG ATTTAGCCCC CGAGGGAAAT CCCAAGGCTA AATCACGAAA 
2201 ACGGCACCTC GACCCCAAAA AACTTGATTA GGGTGATGGT TCACGTAGTG 

TGCCGTGGAG CTGGGGTTTT TTGAACTAAT CCCACTACCA AGTGCATCAC 
2251 GGCCATCGCC CTGATAGACG GTmTCGCC CTTTGACGTT GGAGTCCACG 

CCGGTAGCGG GACTATCTGC CAAAAAGCGG GAAACTGCAA CCTCAGGTGC 
2301 TTCTTTAATA GTGGACTCTT GTTCCAAACT GGAACAACAC TCAACCCTAT 

AAGAAATTAT CACCTGAGAA CAAGGTTTGA CCTTGTTGTG AGTTGGGATA 
2351 CTCGGTCTAT TCTTTTGATT TATAAGGGAT TTTGCCGATT TCGGCCTATT 

GAGCCAGATA AGAAAACTAA ATAnCCCTA AAACGGCTAA AGCCGGATAA 
2401 GGT TAAAAAA TGAGCTGATT TAAC AAAAAT TTAACGCGAA TTTTAACAAA 

CCAAI 1 1 1 1 1 ACTCGACTAA ATTGTTTTTA AATTGCGCTT AAAATTGTTT 
2451 ATATTAACGC TTACAATTTC CATTCGCCAT TCAGGCTGAA CTAGATCTAG 

TATAATTGCG AATGTTAAAG GTAAGCGGTA AGTCCGACTT GATCTAGATC 
2501 AGTCCGTTAC ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC 

TCAGGCAATG TATTGAATGC CATTTACCGG GCGGACCGAC TGGCGGGTTG 
2551 GACCCCCGCC CATTGACGTC AATAATGACG TATGTTCCCA TAGTMCGCC 

CTGGGGGCGG GTAACTGCAG TTATTACTGC ATACAAGGGT ATCATTGCGG 
2601 AATAGGGACT TTCCATTGAC GTCAATGGGT GGAGTATTTA CGGTAAACTG 

TTATCCCTGA AAGGTAACTG CAGTTACCCA CCTCATAAAT GCCATTTGAC 
2651 CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC GCCCCCTATT 

GGGTGAACCG TCATGTAGTT CACATAGTAT ACGGTTCATG CGGGGGATAA 
2701 GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 

CTGCAGTTAC TGCCATTTAC CGGGCGGACC GTAATACGGG TCATGTACTG 
2751 CTTATGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA 

GAATACCCTG AAAGGATGAA CCGTCATGTA GATGCATAAT CAGTAGCGAT 
2801 TTACCATGGT GATGCGGTTT TGGCAGTACA TCAATGGGCG TGGATAGCGG 

AATGGTACCA CTACGCCAAA ACCGTCATGT AGTTACCCGC ACCTATCGCC 
2851 TTTGACTCAC GGGGATTTCC AAGTCTCCAC CCCATTGACG TCAATGGGAG 

AAACTGAGTG CCCCTAAAGG TTCAGAGGTG GGGTAACTGC AGTTACCCTC 
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2901 TTTGTTTTGG CACCAAAATC AACGGGACTT TCCAAAATGT C6TAACAACT 

AAACAAAACC GTGGTTTTAG TTGCCCTGAA AGGTTTTACA GCATTGTTGA 
2951 CCGCCCCATT GACGCAAATG GGCGGTAGGC GTGTACGGTG GGAGGTCTAT 

GGCGGGGTAA CTGCGTTTAC CCGCCATCCG CACATGCCAC CCTCCAGATA 
3001 ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC 
j TATTCGTCTC GAGCAAATCA CTTGGCAGTC TAGCGGACCT CTGCGGTAGG 
3051 ACGCTGTTTT GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG 

TGCGACAAAA CTGGAGGTAT CTTCTGTGGC CCTGGCTAGG TCGGAGGCGC 
3101 GCCGGGAACG GTGCATTGGA ACGGACCGTG TTGACAATTA ATCATCGGCA 

CGGCCCTTGC CACGTAACCT TGCCTGGCAC AACTGTTAAT TAGTAGCCGT 
3151 TAGTATATCG GCATAGTATA ATACGACAAG GTGAGGAACT AAACCATGGC 

ATCATATAGC CGTATCATAT TATGCTGTTC CACTCCTTGA TTTGGTACCG 
3201 CAAGTTGACC AGTGCCGTTC CGGTGCTCAC CGCGCGCGAC GTCGCCGGAG 

GTTCAACTGG TCACGGCAAG GCCACGAGTG GCGCGCGCTG CAGCGGCCTC 
3251 CGGTCGAGTT CTGGACCGAC CGGCTCGGGT TCTCCCGGGA CTTCGTGGAG 

GCCAGCTCAA GACCTGGCTG GCCGAGCCCA AGAGGGCCCT GAAGCACCTC 
3301 GACGACTTCG CCGGTGTGGT CCGGGACGAC GTGACCCTGT TCATCAGCGC 

CTGCTGAAGC GGCCACACCA GGCCCTGCTG CACTGGGACA AGTAGTCGCG 
3351 GGTCCAGGAC CAGGTGGTGC CGGACAACAC CCTGGCCTGG GTGTGGGTGC 

CCAGGTCCTG GTCCACCACG GCCTGTTGTG GGACCGGACC CACACCCACG 
3401 GCGGCCTGGA CGAGCTGTAC GCCGAGTGGT CGGAGGTCGT GTCCACGAAC 

CGCCGGACCT GCTCGACATG CGGCTCACCA GCCTCCAGCA CAGGTGCTTG 
3451 TTCCGGGACG CCTCCGGGCC GGCCATGACC GAGATCGGCG AGCAGCCGTG 

AAGGCCCTGC GGAGGCCCGG CCGGTACTGG CTCTAGCCGC TCGTCGGCAC 
3501 GGGGCGGGAG TTCGCCCTGC GCGACCCGGC CGGCAACTGC GTGCACTTCG 

CCCCGCCCTC AAGCGGGACG CGCTGGGCCG GCCGTTGACG CACGTGAAGC 
3551 TGGCCGAGGA GCAGGACTGA CACTCGACCT CGAAACTTGT TTATTGCAGC 

ACCGGCTCCT CGTCCTGACT GTGAGCTGGA GCTTTGAACA AATAACGTCG 
3601 TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 

A ATATTACCA ATGTTTATTT CGTTATCGTA GTGTTTAAAG TGTTTATTTC 
3651 CAM I IIIIC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA 

GTAAAAAAAG TGACGTAAGA TCAACACCAA ACAGGTTTGA GTAGTTACAT 
3701 TCTTATCATG TCTGGATCGA TACTTCAAGA ACTGCTGACA TCGAGCTTGC 

AGAATAGTAC AGACCTAGCT ATGAAGTTCT TGACGACTGT AGCTCGAACG 
3751 TACAAGGGAC TTTCCGCTGG GGACTTTCCA GGGAGGCGTG GCCTGGGCGG 

ATGTTCCCTG AAAGGCGACC CCTGAAAGGT CCCTCCGCAC CGGACCCGCC 
3801 GACTGGGGAG TGGCGAGCCC TCAGATCCTG CATATAAGCA GCTGCTTTTT 

CTGACCCCTC ACCGCTCGGG AGTCTAGGAC GTATATTCGT CGACGAAAAA 
3851 GCCTGTACTG GGTCTCTCTG GTTAGACCAG ATCTGAGCCT GGGAGCTCTC 

CGGACATGAC CCAGAGAGAC CAATCTGGTC TAGACTCGGA CCCTCGAGAG 
3901 TGGCTAACTA GGGAACCCAC TGCTTAAGCC TCAATAAAGC TTGGTACCGA 

ACCGATTGAT CCCTTGGGTG ACGAATTCGG AGTTATTTCG AACCATGGCT 
3951 GCTCGGATCC GAATTCGCCA CC 

CGAGCCTAGG CTTAAGCGGT GG 
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(SEQ ID NO: 21 AND 22) 

1 6ACGGATCGG GAGATCTCCC GATCCCCTAT 6GTCGACTCT CAGTACAATC 

CTGCCTAGCC CTCTAGAGGG CTAGGGGATA CCAGCTGAGA GTCATGTTAG 
51 TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 

ACGAGACTAC GGCGTATCAA TTCGGTCATA GACGAGGGAC GAACACACAA 
101 GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG 

CCTCCAGCGA CTCATCACGC GCTCGTTTTA AATTCGATGT TGTTCCGTTC 
151 GCTTGACCGA CAATTGCATG AAGAATCTGC TTAGGGTTAG GCGTTTTGCG 

CGAACTGGCT GTTAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC 
201 CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT GATTATTGAC 

GACGAAGCGC TACATGCCCG GTCTATATGC GCAACTGTAA CTAATAACTG 
251 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 

ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 
301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 

ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 
351 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 

GGGTTGCTGG GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 
401 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT 

TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTG ATAAATGCCA 
451 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 

TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 
501 CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 

GGATAACTGC AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 
551 CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 

GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 

AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 
651 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 

ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 
701 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 

ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 
751 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 

TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 
801 GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA CTGCTTACTG 

CAGATATATT CGTCTCGAGA GACCGATTGA TCTCTTGGGT GACGAATGAC 
851 GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 

CGAATAGCTT TAATTATGCT GAGTGATATC CCTCTGGGTT CGACCGATCG 
901 GTTTAAACTT MGCTTCCCC GCGCAGGGTC GCGATGCTGC CCGGTTTGGC 

CAAATTTGAA TTCGAAGGGG CGCGTCCCAG CGCTACGACG GGCCAAACCG 
951 ACTGCTCCTG CTGGCCGCCT GGACGGCTCG GGCGCTGGAG GTACCCACTG 

TGACGAGGAC GACCGGCGGA CCTGCCGAGC CCGCGACCTC CATGGGTGAC 
1001 ATGGTAATGC TGGCCTGCTG GCTGAACCCC AGATTGCCAT GTTCTGTGGC 

TACCATTACG ACCGGACGAC CGACTTGGGG TCTAACGGTA CAAGACACCG 
1051 AGACTGAACA TGCACATGAA TGTCCAGAAT GGGAAGTGGG ATTCAGATCC 

TCTGACTTGT ACGTGTACTT ACAGGTCTTA CCCTTCACCC TAAGTCTAGG 
1101 ATCAGGGACC AAAACCTGCA TTGATACCAA GGAAGGCATC CTGCAGTATT 

TAGTCCCTGG TTTTGGACGT AACTATGGTT CCTTCCGTAG GACGTCATAA 
1151 GCCAAGAAGT CTACCCTGAA CTGCAGATCA CCAATGTGGT AGAAGCCAAC 

CGGTTCTTCA GATGGGACTT GACGTCTAGT GGTTACACCA TCTTCGGTTG 
1201 CAACCAGTGA CCATCCAGAA CTGGTGCAAG CGGGGCCGCA AGCAGTGCAA 

GTTGGTCACT GGTAGGTCTT GACCACGTTC GCCCCGGCGT TCGTCACGTT 
1251 GACCCATCCC CACTTTGTGA TTCCCTACCG CTGCTTAGTT GGTGAGTTTA 

CTGGGTAGGG GTGAAACACT AAGGGATGGC GACGAATCAA CCACTCAAAT 
1301 TAAGTGATGC CCTTCTCGTT CCTGACAAGT GCAAATTCTT ACACCAGGAG 

ATTCACTACG GGAAGAGCAA GGACTGTTCA CGTTTAAGAA TGTGGTCCTC 
1351 AGGATGGATG TTTGCGAAAC TCATCTTCAC TGGCACACCG TCGCCAAAGA 

TCCTACCTAC AAACGCTTTG AGTAGAAGTG ACCGTGTGGC AGCGGTTTCT 
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1401 GACATGCAGT GA6AAGAGTA CCAACTTGCA TGACTACGGC ATGTTGCTGC 

CTGTACGTCA CTCTTCTCAT GGTTGAACGT ACTGATGCCG TACAACGACG 
1451 CCTGCGGAAT TGACAAGTTC CGAGGGGTAG AGTTTGTGTG TTGCCCACTG 

GGACGCCTTA ACTGTTCAAG GCTCCCCATC TCAAACACAC AACGGGTGAC 
1501 GCTGAAGAAA GTGACAATGT GGATTCTGCT GATGCGGAGG AGGATGACTC 

CGACTTCTTT CACTGTTACA CCTAAGACGA CTACGCCTCC TCCTACTGAG 
1551 GGATGTCTGG TGGGGCGGAG CAGACACAGA CTATGCAGAT GGGAGTGAAG 

CCTACAGACC ACCCCGCCTC GTCTGTGTCT GATACGTCTA CCCTCACTTC 
1601 ACAAAGTAGT AGAAGTAGCA GAGGAGGAAG AAGTGGCTGA GGTGGAAGAA 

TGTTTCATCA TCTTCATCGT CTCCTCCTTC TTCACCGACT CCACCTTCTT 
1551 GAAGAAGCCG ATGATGACGA GGACGATGAG GATGGTGATG AGGTAGAGGA 

CTTCTTCGGC TACTACTGCT CCTGCTACTC CTACCACTAC TCCATCTCCT 
1701 AGAGGCTGAG GAACCCTACG AAGAAGCCAC AGAGAGAACC ACCAGCATTG 

TCTCCGACTC CTTGGGATGC TTCTTCGGTG TCTCTCTTGG TGGTCGTAAC 
1751 CCACCACCAC CACCACCACC ACAGAGTCTG TGGAAGAGGT GGTTCGAGTT 

GGTGGTGGTG GTGGTGGTGG TGTCTCAGAC ACCTTCTCCA CCAAGCTCAA 
1801 CCTACAACAG CAGCCAGTAC CCCTGATGCC GTTGACAAGT ATCTCGAGAC 

GGATGTTGTC GTCGGTCATG GGGACTACGG CAACTGTTCA TAGAGCTCTG 
1851 ACCTGGGGAT GAGAATGAAC ATGCCCATTT CCAGAAAGCC AAAGAGAGGC 

TGGACCCCTA CTCTTACTTG TACGGGTAAA GGTCTTTCGG TTTCTCTCCG 
1901 TTGAGGCCAA GCACCGAGAG AGAATGTCCC AGGTCATGAG AGAATGGGAA 

AACTCCGGTT CGTGGCTCTC TCTTACAGGG TCCAGTACTC TCTTACCCTT 
1951 GAGGCAGAAC GTCAAGCAAA GAACTTGCCT AAAGCTGATA AGAAGGCAGT 

CTCCGTCTTG CAGTTCGTTT CTTGAACGGA TTTCGACTAT TCTTCCGTCA 
2001 TATCCAGCAT TTCCAGGAGA AAGTGGAATC TTTGGAACAG GAAGCAGCCA 

ATAGGTCGTA AAGGTCCTCT TTCACCTTAG AAACCTTGTC CTTCGTCGGT 
2051 ACGAGAGACA GCAGCTGGTG GAGACACACA TGGCCAGAGT GGAAGCCATG 

TGCTCTCTGT CGTCGACCAC CTCTGTGTGT ACCGGTCTCA CCTTCGGTAC 
2101 CTCAATGACC GCCGCCGCCT GGCCCTGGAG AACTACATCA CCGCTCTGCA 

GAGTTACTGG CGGCGGCGGA CCGGGACCTC TTGATGTAGT GGCGAGACGT 
2151 GGCTGTTCCT CCTCGGCCTC GTCACGTGTT CAATATGCTA AAGAAGTATG 

CCGACAAGGA GGAGCCGGAG CAGTGCACAA GTTATACGAT TTCTTCATAC 
2201 TCCGCGCAGA ACAGAAGGAC AGACAGCACA CCCTAAAGCA TTTCGAGCAT 

AGGCGCGTCT TGTCTTCCTG TCTGTCGTGT GGGATTTCGT AAAGCTCGTA 
2251 GTGCGCATGG TGGATCCCAA GAAAGCCGCT CA6ATCCGGT CCCAGGTTAT 

CACGCGTACC ACCTAGGGTT CTTTCGGCGA GTCTAGGCCA GGGTCCAATA 
2301 GACACACCTC CGTGTGATTT ATGAGCGCAT GAATCAGTCT CTCTCCCTGC 

CTGTGTGGAG GCACACTAAA TACTCGCGTA CTTAGTCAGA GAGAGGGACG 
2351 TCTACAACGT GCCTGCAGTG GCCGAGGAGA TTCAGGATGA AGTTGATGAG 
• AGATGTTGCA CGGACGTCAC CGGCTCCTCT AAGTCCTACT TCAACTACTC 
2401 CTGCTTCAGA AAGAGCAAAA CTATTCAGAT GACGTCTTGG CCAACATGAT 

GACGAAGTCT TTCTCGTTTT GATAAGTCTA CTGCAGAACC GGTTGTACTA 
2451 TAGTGAACCA AGGATCAGTT ACGGAAACGA TGCTCTCATG CCATCTTTGA 

ATCACTTGGT TCCTAGTCAA TGCCTTTGCT ACGAGAGTAC GGTAGAAACT 
2501 CCGAAACGAA AACCACCGTG GAGCTCCTTC CCGTGAATGG AGAGTTCAGC 

GGCTTTGCTT TTGGTGGCAC CTCGAGGAAG GGCACTTACC TCTCAAGTCG 
2551 CTGGACGATC TCCAGCCGTG GCATTCTTTT GGGGCTGACT CTGTGCCAGC 

GACCTGCTAG AGGTCGGCAC CGTAAGAAAA CCCCGACTGA GACACGGTCG 
2601 CAACACAGAA AACGAAGTTG AGCCTGTTGA TGCCCGCCCT GCTGCCGACC 

GTTGTGTCTT TTGCTTCAAC TCGGACAACT ACGGGCGGGA CGACGGCTGG 
2651 GAGGACTGAC CACTCGACCA GGTTCTGGGT TGACAAATAT CAAGACGGAG 

CTCCTGACTG GTGAGCTGGT CCAAGACCCA ACTGTTTATA GTTCTGCCTC 
2701 GAGATCTCTG AAGTGAATCT AGATGCAGAA TTCCGACATG ACTCAGGATA 

CTCTAGAGAC TTCACTTAGA TCTACGTCTT AAGGCTGTAC TGAGTCCTAT 
2751 TGAAGTTCAT CATCAAAAAT TGGTGTTCTT TGCAGAAGAT GTGGGTTCAA 

ACTTCAAGTA GTAGTTTTTA ACCACAAGAA ACGTCTTCTA CACCCAAGTT 
2801 ACAAAGGTGC AATCATTGGA CTCATGGTGG GCGGTGTTGT CATAGCGACA 

TGTTTCCACG TTAGTAACCT GAGTACCACC CGCCACAACA GTATCGCTGT 
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2851 GTGATCGTCA TCACCTTGGT 

CACTAGCAGT AGTGGAACCA 
2901 AGTAGATCCT AGACTAGAGC 

TCATCTAGGA TCTGATCTCG 
2951 CTGCTTGTAC CAATTGCTAT 

GACGAACATG GTTAACGATA 
3001 TGTTTCATGA CAAAAGCCTT 

ACAAAGTACT GTTTTCGGAA 
3051 ACAGCGACGA AGAGCTCATC 

TGTCGCTGCT TCTCGAGTAG 
3101 CAAAGCAGTA AGTAGGCGGC 

GTTTCGTCAT TCATCCGCCG 
3151 GCTGATCAGC CTCGACTGTG 

CGACTAGTCG GAGCTGACAC 
3201 CCCTCCCCCG TGCCTTCCTT 

GGGAGGGGGC ACGGAAGGAA 
3251 TTCCTAATAA AATGAGGAAA 

AAGGATTATT TTACTCCTTT 
3301 CTATTCTGGG GGGTGGGGTG 

GATAAGACCC CCCACCCCAC 
3351 GACMTAGCA GGCATGCTGG 

CTGTTATCGT CCGTACGACC 
3401 GGAAAGAACC AGCTGGGGCT 

CCTTTCTTGG TCGACCCCGA 
3451 GCGCATTAAG CGCGGCGGGT 

CGCGTAATTC GCGCCGCCCA 
3501 CTTGCCAGCG CCCTAGCGCC 

GAACGGTCGC GGGATCGCGG 
3551 CGCCACGTTC GCCGGCTTTC 

GCGGTGCAAG CGGCCGAAAG 
3601 TAGGGTTCCG ATTTAGTGCT 

ATCCCAAGGC TAMTCACGA 
3651 TAGGGTGATG GTTCACGTAG 

ATCCCACTAC CAAGTGCATC 
3701 CCCTTTGACG TTGGAGTCCA 

GGGAAACTGC AACCTCAGGT 
3751 CTGGAACAAC ACTCAACCCT 

GACCTTGTTG TGAGTTGGGA 
3801 A7TTTGGGGA TTTCGGCCTA 

TAAAACCCCT AAAGCCGGAT 
3851 ATTTAACGCG AATTAATTCT 

TAAATTGCGC TTAATTAAGA 
3901 GTCCCCAGGC TCCCCAGGCA 

CAGGGGTCCG AGGGGTCCGT 
3951 TAGTCAGCAA CCAGGTGTGG 

ATCAGTCGTT GGTCCACACC 
4001 TATGCAAAGC ATGCATCTCA 

ATACGTTTCG TACGTAGAGT 
4051 CTCCGCCCAT CCCGCCCCTA 

GAGGCGGGTA GG GCGGGG AT 
4101 CATGGCTGAC TAAI 1 1 1 1 1 1 

GTACCGACTG ATTAAAAAAA 

4151 ctctgagcta ttccagaagt 
gagactcgat aaggtcttca 

4201 tttgc aaaaa gctcccggga 
aaacghttt cgagggccct 
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GATGCTGAAG aagaaagata tcatggagcc 

CTACGACTTC TTCTTTCTAT AGTACCTCGG 
CCTGGAAGCA TCCAGGAAGT CAGCCTAAAA 
GGACCTTCGT AGGTCCTTCA GTCGGATTTT 
TG TAAAAA GT GTTGCTTTCA TTGCCAAGTT 
ACATTTTTCA CAACGAAAGT AACGGTTCAA 
AGGCATCTCC TATGGCAGGA AGAAGCGGAG 
TCCGTAGAGG ATACCGTCCT TCTTCGCCTC 
AGAACAGTCA GACTCATCAA GCTTCTCTAT 
TC7TGTCAGT CTGAGTAGTT CGAAGAGATA 
CGCTCGAGTC TAGAGGGCCC GTTTAAACCC 
GCGAGCTCAG ATCTCCCGGG CAAATTTGGG 
CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 
GGAAGATCAA CGGTCGGTAG ACAACAAACG 
GACCCTGGAA GGTGCCACTC CCACTGTCCT 
CTGGGACCTT CCACGGTGAG GGTGACAGGA 
TTGCATCGCA TTGTCTGAGT AGGTGTCATT 
AACGTAGCGT AACAGACTCA TCCACAGTAA 
GGGCAGGACA GCAAGGGGGA GGATTGGGAA 
CCCGTCCTGT CGTTCCCCCT CCTAACCbTT 
GGATGCGGTG GGCTCTATGG CTTCTGAGGC 
CCTACGCCAC CCGAGATACC GAAGACTCCG 
CTAGGGGGTA TCCCCACGCG CCCTGTAGCG 
GATCCCCCAT AGGGGTGCGC GGGACATCGC 
GTGGTGGTTA CGCGCAGCGT GACCGCTACA 
CACCACCAAT GCGCGTCGCA CTGGCGATGT 
CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT 
GCGAGGAAAG CGAAAGAAGG GAAGGAAAGA 
CCCGTCAAGC TCTAAATCGG GGCATCCCTT 
GGGCAGTTCG AGATTTAGCC CCGTAGGGAA 
TTACGGCACC TCGACCCCAA AAAACTTGAT 
AATGCCGTGG AGCTGGGGTT TTTTGAACTA 
TGGGCCATCG CCCTGATAGA CGGTTTTTCG 
ACCCGGTAGC GGGACTATCT GCCAAAAAGC 
CGTTCTTTAA TAGTGGACTC TTGTTCCAAA 
GCAAGAAATT ATCACCTGAG AACAAGGTTT 
ATCTCGGTCT ATTCTTTTGA TTTATAAGGG 
TAGAGCCAGA TAAGAAAACT AAATATTCCC 
TTGGTTAAAA AATGAGCTGA TTTAACAAAA 
AACCAATTTT TTACTCGACT AAATTGTTTT 
GTGGAATGTG TGTCAGTTAG GGTGTGGAAA 
CACCTTACAC ACAGTCAATC CCACACCTTT 
GGCAGAAGTA TGCAAAGCAT GCATCTCAAT 
CCGTCTTCAT ACGTTTCGTA CGTAGAGTTA 
AAAGTCCCCA GGCTCCCCAG CAGGCAGAAG 
TTTCAGGGGT CCGAGGGGTC GTCCGTCTTC 
ATTAGTCAGC AACCATAGTC CCGCCCCTAA 
TAATCAGTCG TTGGTATCAG GGCGGGGATT 
ACTCCGCCCA GTTCCGCCCA TTCTCCGCCC 
TGAGGCGGGT CAAGGCGGGT AAGAGGCGGG 
TATTTATGCA GAGGCCGAGG CCGCCTCTGC 
ATAAATACGT C TCCGGCT CC GGCGGAGACG 
AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT 
TCACTCCTCC GAAAAAACCT CCGGATCCGA 
GCTTGTATAT CCATTTTCGG ATCTGATCAG 
CGAACATATA GGTAAAAGCC TAGACTAGTC 
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4251 CACGTGTTGA CAATTAATCA TCGGCATAGT ATATCGGCAT AGTATAATAC 

GTGCACAACT GTTAATTAGT AGCCGTATCA TATAGCCGTA TCATATTATG 
4301 GACAAGGTGA GGAACTAAAC CATGGCCAAG TTGACCAGTG CCGTTCCGGT 

CTGTTCCACT CC7TGATTTG GTACCGGTTC AACTGGTCAC GGCAAGGCCA 
4351 GCTCACCGCG CGCGACGTCG CCGGAGCGGT CGAGTTCTGG ACCGACCGGC 

CGAGTGGCGC GCGCTGCAGC GGCCTCGCCA GCTCAAGACC TGGCTGGCCG 
4401 TCGGG7TCTC CCGGGACTTC GTGGAGGACG ACTTCGCCGG TGTGGTCCGG 

AGCCCAAGAG GGCCCTGAAG CACCTCCTGC TGAAGCGGCC ACACCAGGCC 
4451 GACGACGTGA CCCTGTTCAT CAGCGCGGTC CAGGACCAGG TGGTGCCGGA 

CTGCTGCACT GGGACAAGTA GTCGCGCCAG GTCCTGGTCC ACCACGGCCT 
4501 CAACACCCTG GCCTGGGTGT GGGTGCGCGG CCTGGACGAG CTGTACGCCG 

GTTGTGGGAC CGGACCCACA CCCACGCGCC GGACCTGCTC GACATGCGGC 
4551 AGTGGTCGGA GGTCGTGTCC ACGAACTTCC GGGACGCCTC CGGGCCGGCC 

TCACCAGCCT CCAGCACAGG TGCTTGAAGG CCCTGCGGAG GCCCGGCCGG 
4601 ATGACCGAGA TCGGCGAGCA GCCGTGGGGG CGGGAGTTCG CCCTGCGCGA 

TACTGGCTCT AGCCGCTCGT CGGCACCCCC GCCCTCAAGC GGGACGCGCT 
4651 CCCGGCCGGC AACTGCGTGC ACTTCGTGGC CGAGGAGCAG GACTGACACG 

GGGCCGGCCG TTGACGCACG TGAAGCACCG GCTCCTCGTC CTGACTGTGC 
4701 TGCTACGAGA TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC 

ACGATGCTCT AAAGCTAAGG TGGCGGCGGA AGATACTTTC CAACCCGAAG 
4751 GGAATCGTTT TCCGGGACGC CGGCTGGATG ATCCTCCAGC GCGGGGATCT 

CCTTAGCAAA AGGCCCTGCG GCCGACCTAC TAGGAGGTCG CGCCCCTAGA 
4801 CATGCTGGAG TTCTTCGCCC ACCCCAACTT GTTTATTGCA GCTTATAATG 

GTACGACCTC AAGAAGCGGG TGGGGTTGAA CAAATAACGT CGAATATTAC 
4851 GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCAI 1 1 1 1 1 

CAATGTTTAT TTCGTTATCG TAGTGTTTAA AGTGTTTATT TCGTAAAAAA 
4901 TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA 

AGTGACGTAA GATCAACACC AAACAGGTTT GAGTAGTTAC ATAGAATAGT 
4951 TGTCTGTATA CCGTCGACCT CTAGCTAGAG CTTGGCGTAA TCATGGTCAT 

ACAGACATAT GGCAGCTGGA GATCGATCTC GAACCGCATT AGTACCAGTA 
5001 AGCTGTTTCC TGTGTGAAAT TGTTATCCGC TCACAATTCC ACACAACATA 

TCGACAAAGG ACACACTTTA ACAATAGGCG AGTGTTAAGG TGTGTTGTAT 
5051 CGAGCCGGAA GCATAAAGTG TAAAGCCTGG GGTGCCTAAT GAGTGAGCTA 

GCTCGGCCTT CGTATTTCAC ATTTCGGACC CCACGGATTA CTCACTCGAT 
5101 ACTCACATTA ATTGCGTTGC GCTCACTGCC CGCTTTCCAG TCGGGAAACC 

TGAGTGTAAT TAACGCAACG CGAGTGACGG GCGAAAGGTC AGCCCTTTGG 
5151 TGTCGTGCCA GCTGCATTAA TGAATCGGCC AACGCGCGGG GAGAGGCGGT 

ACAGCACGGT CGACGTAATT ACTTAGCCGG TTGCGCGCCC CTCTCCGCCA 
5201 TTGCGTATTG GGCGCTCTTC CGCTTCCTCG CTCACTGACT CGCTGCGCTC 

AACGCATAAC CCGCGAGAAG GCGAAGGAGC GAGTGACTGA GCGACGCGAG 
5251 GGTCGTTCGG CTGCGGCGAG CGGTATGAGC TCACTCAAAG GCGGTAATAC 

CCAGCAAGCC GACGCCGCTC GCCATAGTCG AGTGAGTTTC CGCCATTATG 
5301 GGTTATCCAC AGAATCAGGG GATAACGCAG GAAAGAACAT GTGAGCAAAA 

CCAATAGGTG TCTTAGTCCC CTATTGCGTC CTTTCTTGTA CACTCGTTTT 
5351 GGCCAGCAAA AGGCCAGGAA CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT 

CCGGTCGTTT TCCGGTCCTT GGCATTTTTC CGGCGCAACG ACCGCAAAAA 
5401 CCATAGGCTC CGCCCCCCTG ACGAGCATCA CAAAAATCGA CGCTCAAGTC 

GGTATCCGAG GCGGGGGGAC TGCTCGTAGT GTTTTTAGCT GCGAGTTCAG 
5451 AGAGGTGGCG AAACCCGACA GGACTATAAA GATACCAGGC GTTTCCCCCT 

TCTCCACCGC TTTGGGCTGT CCTGATATTT CTATGGTCCG CAAAGGGGGA 
5501 GGAAGCTCCC TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 

CCTTCGAGGG AGCACGCGAG AGGACAAGGC TGGGACGGCG AATGGCCTAT 
5551 CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT CAATGCTCAC 

GGACAGGCGG AAAGAGGGAA GCCCTTCGCA CCGCGAAAGA GTTACGAGTG 
5601 GCTGTAGGTA TCTCAGTTCG GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT 

CGACATCCAT AGAGTCAAGC CACATCCAGC AAGCGAGGTT CGACCCGACA 
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5651 GTGCACGAAC CCCCCGTTCA GCCCGACCGC TGCGCCTTAT CCGGTAACTA 

CACGTGCTTG GGGGGCAAGT CGGGCTGGCG ACGCGGAATA GGCCATTGAT 
5701 TCGTCTTGAG TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG 

AGCAGAACTC AGGTTGGGCC ATTCTGTGCT GAATAGCGGT GACCGTCGTC 
5751 CCACTGGTAA CAGGATTAGC AGAGCGAGGT ATGTAGGCGG TGCTACAGAG 

GGTGACCATT GTCCTAATCG TCTCGCTCCA TACATCCGCC ACGATGTCTC 
5801 TTCTTGAAGT GGTGGCCTAA CTACGGCTAC ACTAGAAGGA CAGTATTTGG 

AAGAACTTCA CCACCGGATT GATGCCGATG TGATCTTCCT GTCATAAACC 
5851 TATCTGCGCT CTGCTGAAGC CAGTTACCTT CGGAAAAAGA GTTGGTAGCT 

ATAGACGCGA GACGACTTCG GTCAATGGAA GCCTTTTTCT CAACCATCGA 
5901 CTTGATCCGG CAAACAAACC ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC 

GAACTAGGCC GTTTGTTTGG TGGCGACCAT CGCCACCAAA AAAACAAACG 
5951 AAGCAGCAGA TTACGCGCAG AAAAAAA GGA TCTCAAGAAG ATCCTTTGAT 

TTCGTCGTCT AATGCGCGTC 1 1 1 1 1 1 1 CCX AGAGTTCTTC TAGGAAACTA 
6001 CTTTTCTACG GGGTCTGACG CTCAGTGGAA CGAAAACTCA CGTTAAGGGA 

GAAAAGATGC CCCAGACTGC GAGTCACCTT GCTTTTGAGT GCAATTCCCT 
6051 TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT CCTTTTAAAT 

AAAACCAGTA CTCTAATAGT TTTTCCTAGA AGTGGATCTA GGAAAATTTA 
6101 TAAAAAT GAA GTTTTAAATC AATCTAAAGT ATATATGAGT AAACTTGGTC 

ATTTTTACTT CAAAATTTAG TTAGATTTCA TATATACTCA TTTGAACCAG 
6151 TGACAGTTAC CAATGCTTAA TCAGTGAGGC ACCTATCTCA GCGATCTGTC 

ACTGTCAATG GTTACGAATT AGTCACTCCG TGGATAGAGT CGCTAGACAG 
6201 TATTTCGTTC ATCCATAGTT GCCTGACTCC CCGTCGTGTA GATAACTACG 

ATAAAGCAAG TAGGTATCAA CGGACTGAGG GGCAGCACAT CTATTGATGC 
6251 ATACGGGAGG GCTTACCATC TGGCCCCAGT GCTGCAATGA TACCGCGAGA 

TATGCCCTCC CGAATGGTAG ACCGGGGTCA CGACGTTACT ATGGCGCTCT 
6301 CCCACGCTCA CCGGCTCCAG ATTTATCAGC AATAAACCAG CCAGCCGGAA 

GGGTGCGAGT GGCCGAGGTC TAAATAGTCG TTATTTGGTC GGTCGGCCTT 
6351 GGGCCGAGCG CAGAAGTGGT CCTGCAACTT TATCCGCCTC CATCCAGTCT 

CCCGGCTCGC GTCTTCACCA GGACGTTGAA ATAGGCGGAG GTAGGTCAGA 
6401 ATTAATTGTT GCCGGGAAGC TAGAGTAAGT AGTTCGCCAG TTAATAGTTT 

TAATTAACAA CGGCCCTTCG ATCTCATTCA TCAAGCGGTC AATTATCAAA 
6451 GCGCAACGTT GTTGCCATTG CTACAGGCAT CGTGGTGTCA CGCTCGTCGT 

CGCGTTGCAA CAACGGTAAC GATGTCCGTA GCACCACAGT GCGAGCAGCA 
6501 TTGGTATGGC TTCATTCAGC TCCGGTTCCC AACGATCAAG GCGAGTTACA 

AACCATACCG AAGTAAGTCG AGGCCAAGGG TTGCTAGTTC CGCTCAATGT 
6551 TGATCCCCCA TGTTGTGCAA AAAAGCGGTT AGCTCCTTCG GTCCTCCGAT 

ACTAGGGGGT ACAACACGTT TTTTCGCCAA TCGAGGAAGC CAGGAGGCTA 
6601 CGTTGTCA6A AGTAAGTTGG CCGCAGTGTT ATCACTCATG GTTATGGCAG 

GCAACAGTCT TCATTCAACC GGCGTCACAA TAGTGAGTAC CAATACCGTC 
6651 CACTGCATAA TTCTCTTACT GTCATGCCAT CCGTAAGATG CTTTTCTGTG 

GTGACGTATT AAGAGAATGA CAGTACGGTA GGCATTCTAC GAAAAGACAC 
6701 ACTGGTGAGT ACTCAACCAA GTCATTCTGA GAATAGTGTA TGCGGCGACC 

TGACCACTCA TGAGTTGGTT CAGTAAGACT CTTATCACAT ACGCCGCTGG 
6751 GAGTTGCTCT TGCCCGGCGT CAATACGGGA TAATACCGCG CCACATAGCA 

CTCAACGAGA ACGGGCCGCA GTTATGCCCT ATTATGGCGC GGTGTATCGT 
6801 GAACTTTAAA AGTGCTCATC ATrGGAAAAC GTTCTTCGGG GCGAAAACTC 

CTTGAAATTT TCACGAGTAG TAACCTTTTG CAAGAAGCCC CGCTTTTGAG 
6851 TCAAGGATCT TACCGCTGTT GAGATCCAGT TCGATGTMC CCACTCGTGC 

AGTTCCTAGA ATGGCGACAA CTCTAGGTCA AGCTACATTG GGTGAGCACG 
6901 ACCCMCTGA TCTTCAGCAT CTTTTACTTT CACCAGCGTT TCTGGGTGAG 

TGGGTTGACT AGAAGTCGTA GAAAATGAAA GTGGTCGCAA AGACCCACTC 
6951 CAAAAA CAGG AAGGCAAAAT GCCGCAAAAA AGGGAATAAG GGCGACACGG 

GTTTTTGTCC TTCCGTTTTA CGGCGTTTTT TCCCTTATTC CCGCTGTGCC 
7001 AAATGTTGAA TACTCATACT CTTCCTTTTT CAATATTATT GAAGCATTTA 

TTTACAACTT ATGAGTATGA GAAGGAAAAA GTTATAATAA CTTCGTAAAT 
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7051 TCAGGGTTAT TGTCTCATGA GCGGATACAT ATTTGAATGT ATTTAGAAAA 
AGTCCCAATA ACAGAGTACT CGCCTATGTA TAAACTTACA TAAATCTTTT 

7101 ATAAACAAAT AGGGGTTCCG CGCACATTTC CCCGAAAAGT GCCACCTGAC 
TATTTGTTTA TCCCCAAGGC GCGTGTAAAG GGGCTTTTCA CGGTGGACTG 

7151 GTC 
CAG 
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DNA sequence of APP(1-651)NFEV. K612V-TATexonl (MIL) APP (664-695) 





Tn Kin. ^ 
1U NU: 








1 


A 1 bL 1 GLLCb 


P'l 1 TPPP A PT 

Gl 1 IGGCACT 


PPTPPTPPTP 

GCTCCTGCTG 


PPPPPPTTPP A OOO0"T*00OO0 

GCCGCCTGGA CGGCTCGGGC 


51 


PPTPP A PPT A 

GCTGGAGGTA 


PPP A PTP ATP 

CCCACTGATG 


OTA ATPPTPP 

GTAATGCTGG 


0P*T"PPT~O/"»/"» , T" ft a a p * 

CCTGCTGGCT GAACCCCAGA 


101 


T — rPPP A TPTT 

TTGCCATGTT 


PTr>TAr> O A P A 

CTGTGGCAGA 


OTP A A P A TPP 

CTGAACATGC 


APA "TV* A a TOT ft a a "r~r%<^/> 

ACATGAATGT CCAGAATGGG 


151 


AAGTGGGATT 


PAPA TPP A TP 

CAGATCCATC 


A OOO A PP AAA 

AGGGACCAAA 


A Pf>T"PP A "1* I /*\ *Ta n/M> A /"\ /~\ A 

ACCTGCATTG ATACCAAGGA 


201 


a ooo a tpptp 

AGGCATCCTG 


P APTATTPPO 

CAGTATTGCC 


A A O A A PTPTA 

AAGAAGTCTA 


000*^0 A A ^Trt P.* f\ ft *T*^> ft ft 

CCCTGAACTG CAGATCACCA 


251 


A TPTPPTAP A 

ATGTGGTAGA 


A PPP A APP A A 

AGCCAACCAA 


OP A PTP A PPA 

CCAGTGACCA 


TPP A P A A PTP OTPn A A 0/*«Or> 

TCCAGAACTG GTGCAAGCGG 


301 


prT'POP a Apr* 

GbLLbCAAGC 


A PTPP A A P A P 

Ab IGCAAGAC 


PP ATPPPP AP 

CCATCCCCAC 


'III 'PTP A 1 1 P PPTAPPPPTP 

TTTGTGATTC CCTACCGCTG 


obi 


PTTA PTTPPT 

CTTAGTTGGT 


p APT 1TATA A 

GAGTTTATAA 


PTP A TPP PPT 

GTGATGCCCT 


TPTPPTTPPT P A P A A PTPP A 

TCTCGTTCCT GACAAGTGCA 


401 


A ATTPTTA P A 

AAlTCTTACA 


PP A PP A P A PP 

CCAGGAGAGG 


A TOO A TP 1 1 1 

ATGGATGTTT 


OOP AAA P-frx ft i i s\ ft /-**T*/*irt 

GCGAAACTCA TCTTCACTGG 


451 


papa r k f > r k Tf > r > 

LALALLbl Lb 


PP AAAPAPAP 

CCAAAGAGAC 


A TPP A PTP A P 

ATGCAGTGAG 


A APAPTAPPA A O'l TPP A TP A 

AAGAGTACCA ACTTGCATGA 


501 


PTAPPPPATP 

CTALbGCATG 


TTPPTPP P PT 

TTGCTGCCCT 


r*f*r*r*h a i rp a 

GCGGAATTGA 


OAAPTTPPPA PPPPTAPAPT 

CAAGTTCCGA GGGGTAGAGT 


bbl 


TTPTPTPTTP 

1 Iblb Ibl lb 


PPP A PTPPPT 

LLLAL 1 bbL 1 


PA APA A APTP 

bAAbAAAblb 


A P A A TPTPP A TTPTPPTP A T 

ACAATGTGGA ttctgctgat 


oul 


bLbbAbbAbb 


ATPAPTPPPA 

A 1 uAL 1 LbbA 


TPTPTPPTPP 

J b 1 L 1 bb 1 bb 


f^onr^f^Af^f^Ar* apapapapta 

GbLGGAGCAG ACACAGACTA 


DOl 


TPP A P ATPPP 

1 bLAbA 1 bbb 


A PTP A APAPA 

Ab 1 bAAbALA 


A APTAPTAPA 

AAblAblAbA 


APTAPPAPAP pappaapaap 

AGTAGCAGAG GAGGAAGAAG 


/Ul 


TPPPTP A PPT 

1 bbL 1 bAbb I 


PPA APA APA A 

bbAAbAAbAA 


PA A PPPP A TP 

bAAbLLbA 1 b 


A TP A PPA PPA PPA TP APP AT 

AIGALGAGGA CGATGAGGAT 


/bl 


PPTP ATP A PP 

bb 1 bA 1 bAbb 


TAPAPPA AP A 

1 AbAbbAAbA 


f*r*nr > a pp a a 
bbL 1 bAbbAA 


PPPTAPPAAP AAPPPAPAPA 

LLLTACGAAG AAGCCACAGA 


QA1 


PAPA APPAPP 

bAbAALLALL 


APPATTPPPA 

AbLAI IbLLA 


PPAPPAPPAP 

LLALLALLAL 


P APP APP APA PAPTPTPTPP 

CACCACCACA GAGTCTGTGG 




A APAPPTPPT 
AAbAbb 1 bb 1 


TPP A PTTPPT 

ILrbAbl ILLI 


APAAPAPPAP 

ALAALAbLAb 


LLAGIALLCC TGATGCCGTT 


am 


PAPA APTATP 
bALAAb 1 A 1 L 


TPPAPAPAPP 

1 LbAbALALL 


TPPPPATPAP 

1 bbbbA 1 bAb 


AA 1 GAALA 1 G CLCATI 1 CCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAA.G TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA AGCCGCTCAG 
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1351 ATCCGGTCCC AGGTTATGAC ACACCTCCGT GTGATTTATG AGCGCATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA TTCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAACTTTGA AGTGGAATTC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAGTATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGTTCAMCA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC 
2051 CAGGAAGTCA GCCTAAAACT GCTTGTACCA ATTGCTATTG TAAAAAGTGT 
2101 TGCTTTCATT GCCAAGTTTG TTTCATGACA AAAGCCTTAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGCGACGAAG AGCTCATCAG AACAGTCAGA 
2201 CTGATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCAGACGCCG CTGTCACCCC AGAGGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 
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(SEQ ID NO: 24) 

Amino acid sequence of APP( 1-651) NFEV, K612V-TATexonI(MlL) APP (664-695) 

ml pgl al 111 aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcet hi h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahfqkakerl eakhrermsqvmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrri al enyi tal qavpprprhvf nml kky vraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeeiqdevdell qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

nfeyef rhdsgyevhhqvl vf f aedvgsnkgai i gl mvqg vviatv i vi tl vml kkk kl gtel gstspvwwns 

6 

ddtl epvdprl epwkhpgsqpktactncyckkccf hcqvcfmtkal gi s ygrkkprarrrahgnsgthgasl ska 

7 8 
ri sstvaaa daavtpeerhl skmqqngyenptvkf f eqman 
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DNA sequence of APP(1-651)NFEV, K612V-GAL4VP16(de1Met) APP (664-695) 



(SEQ 


ID NO: 25) 










1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGI 1 1 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 



FIG.30A 



WO 03/072041 



PCT/US03/05458 



59/76 



1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG 


AGCGCATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


GAGGAGATTC 


1451 


AG6ATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1501 


GTCTTGGCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATGCCA 


TCTTTGACCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 


AGCCGTGGCA 


TTCTTTTGGG 


1651 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


GAAGTTGAGC 


CTGTTGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGGGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAACTTTGA 


AGTGGAATTC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAGTATTGG 


TGTTCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGTGT 


GGTGGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT 


CTTCTATCGA 


ACAAGCATGC 


GATATTTGCC 


2051 


GACTTAAAAA 


GCTCAAGTGC 


TCCAAAGAAA 


AACCGAAGTG 


CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG 


TCGCTACTCT 


CCCAAAACCA 


AAAGGTCTCC 


2151 


GCTGACTAGG 


GCACATCTGA 


CAGAAGTGGA 


ATCAAGGCTA 


GAAAGACTGG 


2201 


AACAGCTATT 


TCTACTGATT 


TTTCCTCGAG 


AAGACCTTGA 


CATGAIIIIG 


2251 


AAAATGGATT 


CTTTACAGGA 


TATAAAAGCA 


TTGTTAACAG 


GATTATTTGT 


2301 


ACAAGATAAT 


GTGAATAAAG 


ATGCCGTCAC 


AGATAGATTG 


GCTTCAGTGG 


2351 


AGACTGATAT 


GCCTCTAACA 


TTGAGACAGC 


ATAGAATAAG 


TGCGACATCA 


2401 


TCATCGGAAG 


AGAGTAGTAA 


CAAAGGTCAA 


AGACAGTTGA 


CTGTATCGGG 


2451 


AATTCCCGGG 


GATCTGGCCC 


CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACTTAGA 


CGGCGAGGAC 


GTGGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GATTTCGATC 


TGGACATGTT 


GGGGGACGGG 


GATTCCCCGG 


GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 
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(SEQ ID NO: 26) 

Amino acid sequence of APP(1-651)NFEV, K612V, GAL4-VP16(delMl) APP (664-695) 
ml pgl all 11 aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 
evypel qi tnvveanqpvt i qnwckrgr kqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 
whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 



edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyletpgdenehahfqkakerleakhrermsqvmreweeaerqaknlpkadkkaviqhfqekvesleqe 

aanerqql vethmarveaml ndrrri al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nfeyef rhdsgyevhhqyl vf f aedvgsnkgai i glmvggvviatyi vi tl vml kk kkl gtel gstspvwwns 

^i -kl 1 ssi eqacdi crl kkl kcskekpkcakcl knnwecryspktkrspl t rahl tevesrl erl eql f 1 1 i f predl d 

6 

mi 1 kinds! qdi kal 1 tgl f vqdnvnkdavtdrl asvetdmpl tl rqhri satssseessnkgqral tvsgi pgdl app 
tdvsl gdel hi dgedvamahadal ddf dl dm! gdgdspgpgf tphdsapygal dmadf ef eqmf tdal qi dev 
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(SEQ ID NO: 27 AND 28) 

1 6ACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC 

CTGCCTAGCC CTCTAGAGGG CTAGGGGATA CCAGCTGAGA GTCATGTTAG 
51 TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 

ACGAGACTAC GGCGTATCAA TTCGGTCATA GACGAGGGAC GAACACACAA 
101 GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG 

CCTCCAGCGA CTCATCACGC GCTCGTTTTA AATTCGATGT TGTTCCGTTC 
151 GCTTGACCGA CAATTGCATG AAGAATCTGC TTAGGGTTAG GCGTTTTGCG 

CGAACTGGCT GTTAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC 
201 CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT GATTATTGAC 

GACGAAGCGC TACATGCCCG GTCTATATGC GCAACTGTM CTAATAACTG 
251 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 

ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 
301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 

ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 
351 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 

GGGTTGCTGG GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 
401 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT 

TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTG ATAAATGCCA 
451 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 

TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 
501 CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 

GGATAACTGC AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 
551 CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 

GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 

AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 
651 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 

ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 
701 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 

ACCCTCAAAC AAAACCGTGG "mTAGTTGC CCTGAAAGGT TTTACAGCAT 
751 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 

TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 
801 GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA CTGCTTACTG 

CAGATATATT CGTCTCGAGA GACCGATTGA TCTCTTGGGT GACGAATGAC 
851 GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 

CGAATAGCTT TAATTATGCT GAGTGATATC CCTCTGGGTT CGACCGATCG 
901 GTTTAAACTT AAGCTTGGTA CCGAGCTCGG ATCCACTAGT CCAGTGTGGT 

CAAATTTGAA TTCGAACCAT GGCTCGAGCC TAGGTGATCA GGTCACACCA 
951 GGAATTCTGC AGATATCCAG CACAGTGGCG GCCGCTCGAG TCTAGAGGGC 

CCTTAAGACG TCTATAGGTC GTGTCACCGC CGGCGAGCTC AGATCTCCCG 
1001 CCGTTTAAAC CCGCTGATCA GCCTCGACTG TGCCTTCTAG TTGCCAGCCA 

GGCAAATTTG GGCGACTAGT CGGAGCTGAC ACGGAAGATC AACGGTCGGT 
1051 TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC TTGACCCTGG AAGGTGCCAC 

AGACAACAAA CGGGGAGGGG GCACGGAAGG AACTGGGACC TTCCACGGTG 
1101 TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG CATTGTCTGA 

AGGGTGACAG GAAAGGATTA TTTTACTCCT TTAACGTAGC GTAACAGACT 
1151 GTAGGTGTCA TTCTATTCTG GGGGGTGGGG TGGGGCAGGA CAGCAAGGGG 

CATCCACAGT AAGATAAGAC CCCCCACCCC ACCCCGTCCT GTCGTTCCCC 
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1201 GAGGATTGGG AAGACAATAG CAGGCATGCT GGGGATGCGG TGGGCTCTAT 

CTCCTAACCC TTCTGTTATC GTCCGTACGA CCCCTACGCC ACCCGAGATA 
1251 GGCTTCTGAG GCGGAAAGAA CCAGCTGGGG CTCTAGGGGG TATCCCCACG 

CCGAAGACTC CGCCTTTCTT GGTCGACCCC GAGATCCCCC ATAGGGGTGC 
1301 CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC 

GCGGGACATC GCCGCGTAAT TCGCGCCGCC CACACCACCA ATGCGCGTCG 
1351 GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT 

CACTGGCGAT GTGAACGGTC GCGGGATCGC GGGCGAGGAA AGCGAAAGAA 
1401 CCCTTCCTTT CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC 

GGGAAGGAAA GAGCGGTGCA AGCGGCCGAA AGGGGCAGTT CGAGATTTAG 
1451 GGGGCATCCC TTTAGGGTTC CGATTTAGTG CTTTACGGCA CCTCGACCCC 

CCCCGTAGGG AAATCCCAAG GCTAAATCAC GAAATGCCGT GGAGCTGGGG 
1501 AAAAAA CTTG ATTAGGGTGA TGGTTCACGT AGTGGGCCAT CGCCCTGATA 

1TTTTTGAAC TAATCCCACT ACCAAGTGCA TCACCCGGTA GCGGGACTAT 
1551 GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGTGGAC 

CTGCCAAAAA GCGGGAAACT GCAACCTCAG GTGCAAGAAA TTATCACCTG 
1601 TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT 

AGAACAAGGT TTGACCTTGT TGTGAGTTGG GATAGAGCCA GATAAGAAAA 
1651 GATTTATAAG GGATTTTGGG GATTTCGGCC TATTGGTTAA AAAATGAGCT 

CTAAATATTC CCTAAAACCC CTAAAGCCGG ATAACCAATT TTTTACTCGA 
1701 GATTTAACAA AAATTTAACG CGAATTAATT CTGTGGAATG TGTGTCAGTT 

CTAAATTGTT TTTAAATTGC GCTTAATTAA GACACCTTAC ACACAGTCAA 
1751 AGGGTGTGGA AAGTCCCCAG GCTCCCCAGG CAGGCAGAAG TATGCAAAGC 

TCCCACACCT TTCAGGGGTC CGAGGGGTCC GTCCGTCTTC ATACGTTTCG 
1801 ATGCATCTCA ATTAGTCAGC AACCAGGTGT GGAAAGTCCC CAGGCTCCCC 

TACGTAGAGT TAATCAGTCG TTGGTCCACA CCTTTCAGGG GTCCGAGGGG 
1851 AGCAGGCAGA AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCATAG 

TCGTCCGTCT TCATACGTTT CGTACGTAGA -GTTAATCAGT CGTTGGTATC 
1901 TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC CAGTTCCGCC 

AGGGCGGGGA TTGAGGCGGG TAGGGCGGGG ATTGAGGCGG GTCAAGGCGG 
1951 CATTCTCCGC CCCATGGCTG ACTAATTTTT TTTATTTATG CAGAGGCCGA 

GTAAGAGGCG GGGTACCGAC TGATTAAAAA AAATAAATAC GTCTCCGGCT 
2001 GGCCGCCTCT GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCI 1 1 1 1 IG 

CCGGCGGAGA CGGAGACTCG ATAAGGTCTT CATCACTCCT CCGAAAAAAC 
2051 GAGGCCTAGG CTTTTGCAAA AAGCTCCCGG GAGCTTGTAT ATCCATTTTC 

CTCCGGATCC GAAAACGTTT TTCGAGGGCC CTCGAACATA TAGGTAAAAG 
2101 GGATCTGATC AGCACGTGTT GACAATTAAT CATCGGCATA GTATATCGGC 

CCTAGACTAG TCGTGCACAA CTGTTAATTA GTAGCCGTAT CATATAGCCG 
2151 ATAGTATAAT ACGACAAGGT GAGGAACTAA ACCATGGCCA AGTTGACCAG 

TATCATATTA TGCTGTTCCA CTCCTTGATT TGGTACCGGT TCAACTGGTC 
2201 TGCCGTTCCG GTGCTCACCG CGCGCGACGT CGCCGGAGCG GTCGAGTTCT 

ACGGCAAGGC CACGAGTGGC GCGCGCTGCA GCGGCCTCGC CAGCTCAAGA 
2251 GGACCGACCG GCTCGGGTTC TCCCGGGACT TCGTGGAGGA CGACTTCGCC 

CCTGGCTGGC CGAGCCCAAG AGGGCCCTGA AGCACCTCCT GCTGAAGCGG 
2301 GGTGTGGTCC GGGACGACGT GACCCTGTTC ATCAGCGCGG TCCAGGACCA • 

CCACACCAGG CCCTGCTGCA CTGGGACAAG TAGTCGCGCC AGGTCCTGGT 
2351 GGTGGTGCCG GACAACACCC TGGCCTGGGT GTGGGTGCGC GGCCTGGACG 

CCACCACGGC CTGTTGTGGG ACCGGACCCA CACCCACGCG CCGGACCTGC 
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2401 AGCTGTACGC CGAGTGGTCG GAGGTCGTGT CCACGAACTT CCGGGACGCC 

TCGACATGCG GCTCACCAGC CTCCAGCACA GGTGCTTGAA GGCCCTGCGG 
2451 TCCGGGCCGG CCATGACCGA GATCGGCGAG CAGCCGTGGG GGCGGGAGTT 

AGGCCCGGCC GGTACTGGCT CTAGCCGCTC GTCGGCACCC CCGCCCTCAA 
2501 CGCCCTGCGC GACCCGGCCG GCAACTGCGT GCACTTCGTG GCCGAGGAGC 

GCGGGACGCG CTGGGCCGGC CGTTGACGCA CGTGAAGCAC CGGCTCCTCG 
2551 AGGACTGACA CGTGCTACGA GATTTCGATT CCACCGCCGC CTTCTATGAA 

TCCTGACTGT GCACGATGCT CTAAAGCTAA GGTGGCGGCG GAAGATACTT 
2601 AGGTTGGGCT TCGGMTCGT TTTCCGGGAC GCCGGCTGGA TGATCCTCCA 

TCCAACCCGA AGCCTTAGCA AAAGGCCCTG CGGCCGACCT ACTAGGAGGT 
2651 GCGCGGGGAT CTCATGCTGG AGTTCTTCGC CCACCCCAAC TTGTTTATTG 

CGCGCCCCTA GAGTACGACC TCAAGAAGCG GGTGGGGTTG AACAAATAAC 
2701 CAGCTTATAA TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT 

GTCGAATATT ACCAATGTTT ATTTCGTTAT CGTAGTGTTT AAAGTGTTTA 
2751 AAAGCATTTT TTTCACTGCA TTCTAGTTGT GGTTTGTCCA AACTCATCAA 

TTTCGTAAAA AAAGTGACGT AAGATCAACA CCAAACAGGT TTGAGTAGTT 
2801 TGTATCTTAT CATGTCTGTA TACCGTCGAC CTCTAGCTAG AGCTTGGCGT 

ACATAGAATA GTACAGACAT ATGGCAGCTG GAGATCGATC TCGAACCGCA 
2851 AATCATGGTC ATAGCTGTTT CCTGTGTGAA ATTGTTATCC GCTCACAATT 

TTAGTACCAG TATCGACAAA GGACACACTT TAACAATAGG CGAGTGTTAA 
2901 CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAAGCCT GGGGTGCCTA 

GGTGTGTTGT ATGCTCGGCC TTCGTATTTC ACATTTCGGA CCCCACGGAT 
2951 ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC 

TACTCACTCG ATTGAGTGTA ATTAACGCAA CGCGAGTGAC GGGCGAAAGG 
3001 AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG 

TCAGCCCTTT GGACAGCACG GTCGACGTAA TTACTTAGCC GGTTGCGCGC 
3051 GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA 

CCCTCTCCGC CAAACGCATA ACCCGCGAGA AGGCGAAGGA GCGAGTGACT 
3101 CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG AGCGGTATCA GCTCACTCAA 

GAGCGACGCG AGCCAGCAAG CCGACGCCGC TCGCCATAGT CGAGTGAGTT 
3151 AGGCGGTAAT ACGGTTATCC ACAGAATCAG GGGATAACGC AGGAAAGAAC 

TCCGCCATTA TGCCAATAGG TGTCTTAGTC CCCTATTGCG TCCTTTCTTG 
3201 ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT 

TACACTCGTT TTCCGGTCGT "ITTCCGGTCC TTGGCATTTT TCCGGCGCAA 
3251 GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC 

CGACCGCAAA AAGGTATCCG AGGCGGGGGG ACTGCTCGTA GTGTTTTTAG 
3301 GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG 

CTGCGAGTTC AGTCTCCACC GCTTTGGGCT GTCCTGATAT TTCTATGGTC 
3351 GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 

CGCAAAGGGG GACCTTCGAG GGAGCACGCG AGAGGACAAG GCTGGGACGG 
3401 GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT 

CGAATGGCCT ATGGACAGGC GGAAAGAGGG AAGCCCTTCG CACCGCGAAA 
3451 CTCAATGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC 

GAGTTACGAG TGCGACATCC ATAGAGTCAA GCCACATCCA GCAAGCGAGG 
3501 AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT 

TTCGACCCGA CACACGTGCT TGGGGGGCAA GTCGGGCTGG CGACGCGGAA 
3551 ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 

TAGGCCATTG ATAGCAGAAC TCAGGTTGGG CCATTCTGTG CTGAATAGCG 
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3601 CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC 

GTGACCGTCG TCGGTGACCA TTGTCCTAAT CGTCTCGCTC CATACATCCG 
3651 GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG 

CCACGATGTC TCAAGAACTT CACCACCGGA TTGATGCCGA TGTGATCTTC 
3701 GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA 

CTGTCATAAA CCATAGACGC GAGACGACTT CGGTCAATGG AAGCCTTTTT 
3751 GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT 

CTCAACC ATC GAGAACTAGG CCGTTTGTTT GGTGGCGACC ATCGCCACCA 
3801 1 1 1 1 1 1 GTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA 

AAAAAACAAA CGTTCGTCGT CTAATGCGCG TCIIIIIIIC CTAGAGTTCT 
3851 AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT 

TCTAGGAAAC TAGAAAAGAT GCCCCAGACT GCGAGTCACC TTGCTTTTGA 
3901 CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG 

GTGCAATTCC CTAAAACCAG TACTCTAATA GTTTTTCCTA GAAGTGGATC 
3951 ATCCTTTTAA AT TAAAAAT G AAGTTTTAAA TCAATCTAAA GTATATATGA 

TAGGAAAATT TAATTTTTAC TTCAAAATTT AGTTAGATTT CATATATACT 
4001 GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT 

CATTTGAACC AGACTGTCAA TGGTTACGAA TTAGTCACTC CGTGGATAGA 
4051 CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG 

GTCGCTAGAC AGATAAAGCA AGTAGGTATC AACGGACTGA GGGGCAGCAC 
4101 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT 

ATCTATTGAT GCTATGCCCT CCCGAATGGT AGACCGGGGT CACGACGTTA 
4151 GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 

CTATGGCGCT CTGGGTGCGA GTGGCCGAGG TCTAAATAGT CGTTATTTGG 
4201 AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC 

TCGGTCGGCC TTCCCGGCTC GCGTCTTCAC CAGGACGTTG AAATAGGCGG 
4251 TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 

AGGTAGGTCA GATAATTAAC AACGGCCCTT CGATCTCATT CATCAAGCGG 
4301 AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT 

TCAATTATCA AACGCGTTGC AACAACGGTA ACGATGTCCG TAGCACCACA 
4351 CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA 

GTGCGAGCAG CAAACCATAC CGAAGTAAGT CGAGGCCAAG GGTTGCTAGT 
4401 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT 

TCCGCTCAAT GTACTAGGGG GTACAACACG TTTTTTCGCC AATCGAGGAA 
4451 CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA 

GCCAGGAGGC TAGCAACAGT CTTCATTCAA CCGGCGTCAC AATAGTGAGT 
4501 TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA 

ACCAATACCG TCGTGACGTA TTAAGAGAAT GACAGTACGG TAGGCATTCT 
4551 TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG 

ACGAAAAGAC ACTGACCACT CATGAGTTGG TTCAGTAAGA CTCTTATCAC 
4601 TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG 

ATACGCCGCT GGCTCAACGA GAACGGGCCG CAGTTATGCC CTATTATGGC 
4651 CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG 

GCGGTGTATC GTCTTGAAAT TTTCACGAGT AGTAACCTTT TGCAAGAAGC 
4701 GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA 

CCCGCTTTTG AGAGTTCCTA GAATGGCGAC AACTCTAGGT CAAGCTACAT 
4751 ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 

TGGGTGAGCA CGTGGGTTGA CTAGAAGTCG TAGAAAATGA AAGTGGTCGC 
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4801 TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA 

AAAGACCCAC TCGTTTTTGT CCTTCCGTTT TACGGCGTTT TTTCCCTTAT 
4851 AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA 

TCCCGCTGTG CCTTTACAAC TTATGAGTAT GAGAAGGAAA AAGTTATAAT 
4901 TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT 

AACTTCGTAA ATAGTCCCAA TAACAGAGTA CTCGCCTATG TATAAACTTA 
4951 GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA 

CATAAATCTT TTTATTTGTT TATCCCCAAG GCGCGTGTAA AGGGGCTTTT 
5001 GTGCCACCTG ACGTC 

CACGGTGGAC TGCAG 
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DNA sequence of APP( 1-651 )NFEV. TATexonl(ML) APP (664-695) 
(SEQ ID NO: 29) 



1 

1 


ATGCTGCCCG 


Oil! PPP A /-»-r- 

GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


P P P A PTP A "T"P 

CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


A POP AT A PTP 

AGGCATCCTG 


P A PT" A ' ■ 1 PPP 

CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


A TPTPPT in* 

ATGTGGTAGA 


A PPP A A PP A A 

AGCCAACCAA 


PP A /vr*/^ A A 

CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


A PTPP A A P A P 

AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


PTT A PIT PPT" 

CTTAGTTGGT 


P A P"l I 1 ATA A 

GAGTTTATAA 


PTP A TPP P m~ 

GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


A ATTPTTAPA 

AATTCTTACA 


PP APP AP APP 

CCAGGAGAGG 


A TPP A TP'I 1 T 

ATGGATGTTT 


PPP AAA A 

GCGAAACTCA 


TCTTCACTGG 


/I ci 


papa pr*PTr*r* 
LALALLb 1 Lb 


PP A A APAPAP 

CCAAAGAGAC 


A TP P A PTP A P 

ATGCAGTGAG 


A A P A PT A PP A 

AAGAGTACCA 


ACTTGCATGA 


cm 
b01 


PTA PPOP ATP 

LlALbbLAlb 


TTPPTPPPPT 


r^r^r^r* a attpa 

GCGGAATTGA 


P A A PTTPPP A 

CAAGTTCCGA 


PPPPTA P A AT 

GGGGTAGAGT 


CC1 

bbl 


TTPTPTPTTP 

1 ibtblbl lb 


PPP A PTPPPT 

LLLAC I bGL T 


PA A P A A A PTP 

GAAGAAAGTG 


A P A A TPTPP A 

ACAATGTGGA 


TTPTPPTP A T 

TTCTGCTGAT 


cm 
oUl 


blbbAbbAbb 


a tp a rTmn A 
A 1 GAL 1 LbGA 


TPTPTPPTPP 

IblCTGGTGG 


(^crr^r 1 a pp a p 

GGCGGAGCAG 


A P A P A P A PT A 

ACACAGACTA 


CC1 

bbl 


tpp A P A TPPP 

1 bt AbA 1 bbb 


APTPA AP AP A 

AbTGAAGACA 


A APTAPTAPA 

AAGTAGTAGA 


A PT A PP A P A P 

AGTAGCAGAG 


PAPPA APA AP 

GAGGAAGAAG 


"7m 

701 


TPPPTPAPPT 

TGGCTGAGGT 


PP AAPAAPAA 

GGAAGAAGAA 


PA A P PPP A TP 

GAAGCCGATG 


A TP A PP A PP A 

ATGACGAGGA 


CGATGAGGAT 


751 


PPTP A TP A PP 

GGTGATGAGG 


T A P A PP A A P A 

TAGAGGAAGA 


/-\/-*PTP a PP A A 

GGCTGAGGAA 


CCCTACGAAG 


A A /\ An A A y-\ A 

AAGCCACAGA 


om 
oUl 


PAPA APP APP 

bAbAALCALL 


A PP ATTPPP A 

AGCATTGCCA 


PP A PP A PP A P 

CCACCACCAC 


P A PP A P^> A /"* A 

CACCACCACA 


GAGTCTGTGG 


obi 


A A P A PPTPPT 

AAbAbb 1 bbl 


TPP A PTTPPT 

TCGAGTTCCT 


A P A A P A PP A P 

ACAACA6CAG 


PP A PTA OPP P 

CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 
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1351 ATCCGGTCCC AGGTTATGAC ACACCTCCGT GTGATTTATG AGCGCATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA TTCTTTTGGG 
.1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAACTTTGA AGTGGAATTC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAAAATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC 
2051 CAGGAAGTCA GCCTAAAACT GCTTGTACCA ATTGCTATTG TAAAAAGTGT 
2101 TGCTTTCATT GCCAAGTTTG TTTCATGACA AAAGCCTTAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGCGACGAAG AGCTCATCAG AACAGTCAGA 
2201 CTCATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCAGACGCCG CTGTCACCCC AGAGGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 
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(SEQ ID NO: 30) 

Amino acid sequence of APP(1-651)NFEV, K612V-TATexonI(MlL) APP (664-695) 

ml pgl aim aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyletpgdenehahfqkakerl eakhrermsqvmreweeaerqaknl pkadkkaviqhfqekvesleqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeeiqdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 45 

nfeyef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvggvviatyi vi tl vnfl kk klcl gtcl gstspvwwns 

6 

ad il epvdprl epwkhpqsqpktactncyckkccf hcqvcf mtkal qi sygrkkr rqrrrahqnsqthqasl skq 

7 8 
ri sstvaaa daavtpeerhl skmqgngyenptykf f eqmqn 
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DNA sequence of APP( 1-651) NFEV. 6AL4VP16(delMet) APP (664-695) 



(SEQ 


ID NO: 31) 










1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


GTGATTTATG 


AGCGCATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


GAGGAGAHC 


1451 


AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1501 


GTCTTGGCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATGCCA 


TCTTTGACCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 


AGCCGTGGCA 


TTCI 1 1 IGGG 


1651 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


GAAGTTGAGC 


CTGTTGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGGGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAACTTTGA 


AGTGGAATTC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAAAATTGG 


TGTTCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGTGT 


GGTGGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT 


CTTCTATCGA 


ACAAGCATGC 


GATATTTGCC 


2051 


GACTTAAAAA 


GCTCAAGTGC 


TCCAAAGAAA 


AACCGAAGTG 


CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG 


TCGCTACTCT 


CCCAAAACCA 


AAAGGTCTCC 


2151 


GCTGACTAGG 


GCACATCTGA 


CAGAAGTGGA 


ATCAAGGCTA 


GAAAGACTGG 


2201 


AACAGCTATT 


TCTACTGATT 


TTTCCTCGAG 


AAGACCTTGA 


CATGAI 1 1 IG 


2251 


AAAATGGATT 


CTTTACAGGA 


TATAAAAGCA 


TTGTTAACAG 


GATTATTTGT 


2301 


ACAAGATAAT 


GTGAATAAAG 


ATGCCGTCAC 


AGATAGATTG 


GCTTCAGTGG 


2351 


AGACTGATAT 


GCCTCTAACA 


TTGAGACAGC 


ATAGAATAAG 


TGCGACATCA 


2401 


TCATCGGAAG 


AGAGTAGTAA 


CAAAGGTCAA 


AGACAGTTGA 


CTGTATCGGG 


2451 


AATTCCCGGG 


GATCTGGCCC 


CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACTTAGA 


CGGCGAGGAC 


GTGGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GATTTCGATC 


TGGACATGTT 


GGGGGACGGG 


GATTCCCCGG 


GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 
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(SEQ ID NO: .32) 

Amino acid sequence of APPU-65DNFEV, GAL4-VP16(delMl) APP (664-695) 
ml pgl al 111 aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 cjycq 
evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 
whtvaketcsekstnl hdygml 1 pcgidkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 



edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qa vpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeeiqdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nfeyef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvg gvvi atv i vi 1 1 vml kk kkl gtel gstspvwwns 

ad tkl 1 ssi eqacdi crl kkl kcskekpkcakcl knnwecryspktkrspl trahl tevesrl erl eql f 1 1 i f predl d 

6 

mi 1 kmdsl qdi kail tgl f vqdnvnkdavtdrl asvetdmpl tl rahri satssseessnkgqrql tvsgi pgdl app 

tdvsl gdel hi dgedvamahadal ddf dl dml gdgdspgpgf tphdsap ygal dmadf ef eqmftdal gi de v 
7 8 



1 
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SEQUENCE LISTING 



<110> Merck & Co., Inc. 
Espeseth, Amy S. 
Ferrer , Marc 
Flores, Osvaldo A. 
Hazuda, Daria J. 
Inglese, James 
Miller, Michael D. 
Register, Bruce 
Shi , Xiao-Ping 
Simon, Adam J. 
Zuck, Paul D. 

<120> ASSAYS TO MONITOR AMYLOID PRECURSOR 
PROTEIN PROCESSING 
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<210> 1 
<211> 2352 
<212> DNA 

<213> fusion protein 



<400> 1 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 



human 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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tctgggttga caaatatcaa gacggaggag atctctgaag tgaatctaga tgcagaattc 1800 

cgacatgact caggatatga agttcatcat caagtattgg tgttctttgc agaagatgtg 1860 

ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 

atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 

agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 

tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 

tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 

aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 

aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 

cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 

atgcagaact ag 2352 

<210> 2 
<211> 783 
<212> PRT 

<213> fusion protein - human 



<400> 2 
























Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu Ala Ala 


Trp Thr 


Ala 


Arg 


1 








5 










10 




15 




Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 


Asn 


Ala Gly Leu 


Leu Ala 


Glu 


Pro 








20 










25 


30 






Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly 


Arg 


Leu 


Asn Met His 


Met Asn 


Val 


Gin 






35 










40 






45 






Asn 


Gly 


Lys 


Trp 


Asp 


Ser 


Asp 


Pro 


Ser 


Gly Thr Lys 


Thr Cys 


He 


Asp 




50 










55 






60 








Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


Cys 


Gin Glu Val 


Tyr Pro 


Glu 


Leu 


65 










70 








75 






80 


Gin 


He 


Thr 


Asn 


Val 


Val 


Glu 


Ala 


Asn 


Gin Pro Val 


Thr He 


Gin 


Asn 










85 










90 




95 




Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys 


Gin 


Cys 


Lys Thr His 


Pro His 


Phe 


Val 








100 










105 




110 






He 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 


Glu 


Phe He Ser 


Asp Ala 


Leu 


Leu 






115 










120 






125 






Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 


His 


Gin Glu Arg 


Met Asp 


Val 


Cys 




130 










135 






140 






Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


Val 


Ala Lys Glu 


Thr Cys 


Ser 


Glu 


145 










150 








155 






160 


Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 


Gly 


Met Leu Leu 


Pro Cys 


Gly 


He 










165 










170 




175 




Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 


Val 


Cys Cys Pro 


Leu Ala 


Glu 


Glu 








180 










185 




190 






Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 


Ala 


Glu Glu Asp 


Asp Ser 


Asp 


Val 






195 










200 






205 






Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 


Tyr 


Ala Asp Gly 


Ser Glu 


Asp 


Lys 




210 










215 






220 








Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


Glu 


Val Ala Glu 


Val Glu 


Glu 


Glu 


225 










230 








235 






240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp Gly Asp 


Glu Val 


Glu 


Glu 










245 










250 




255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 


Thr Glu Arg 


Thr Thr 


Ser 


He 








260 










265 




270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 


Ser Val Glu 


Glu Val 


Val 


Arg 






275 










280 






285 




Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 


Asp Ala Val 


Asp Lys 


Tyr 


Leu 




290 










295 






300 






Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


His 


Ala His Phe 


Gin Lys 


Ala 


Lys 


305 










310 








315 






320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 


Arg Met Ser 


Gin Val 


Met 


Arg 










325 










330 




335 


Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 


Lys Asn Leu 


Pro Lys 


Ala 


Asp 








340 










345 




350 
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Lys 


Lys 


Ala Val 


±±e Laxn nis rTie 


Gin giu Lys vai 


pi,. 

GIU 


Ser 


Leu 


pi :n 

bill 






355 


■sen 




*a c c 








bin 


Glu 


Ala Ala 


Asn giu Arg bin 


pi- r aii TT=»1 Pi „ 

Gin Leu vai giu 


Thr 


HIS 


jxie u 


Ala 

Aia 




370 




J / D 


Ton 
joU 










Axg 


Val 


Glu Ala 


Met lieu Asn Asp 


Arg Arg Arg Leu 


Aia 


Leu 


ljJ.U 


Asn 


















Ann 


Tyr 


lie 


Thr Ala 


Leu Gin Aia vai 


Pro Pro Arg Pro 


Arg 


TT J — 
HIS 


vai 


irne 










411U 










Asn 


Met 


Leu Lys 


Lys Tyr Val Arg 


7V 1 fa Pi i< pi- T rn 

Ala giu Gin Lys 


Asp 


Arg 


Pi _ 

Gin 


HIS 






420 




/ O C 




a "in 
4 6 U 






TlHT 


Leu Lys His 


?ne giu his vai 


Arg Met vai Asp 


Pro 


Lys 


Lys 


Ala 






435 


A ACi 




A A TZ. 








Ala 


Gin lie Arg 


ber Gin vai Met 


Tnr His Leu Arg 


Val 


lie 


Tyr 


GlU 




450 




/i c ^ 


/ c n 
4oU 










Arg 


Met 


Asn Gin 


Ser Leu Ser Leu 


Leu Tyr Asn Val 


Pro 


Ala 


vai 


7V 1 -a 

Ala 








A 1 A 


4t ID 








ilOA 

4oU 


GIU 


Glu 


He Gin 


Asp Glu Val Asp 


Glu Leu Leu Gin 


Lys 


pi _, 
GlU 


pi 
Gin 


Asn 








/I Q C 


/ion 
4y u 






>l QC 




Tyx 


Ser Asp Asp 


Val Leu Ala Asn 


Met lie ber giu 


Pro 


Arg 


Tl ^ 

lie 


Ser 






500 




C AC 

d\jd 




CIA 

DlU 






Tyr 


Gly Asn Asp 


Ala Leu Met Pro 


Ser Leu Thr Glu 


Thr 


Lys 


Thr 


Thr 






515 


c *i a 




c o c 








vai 


Glu 


Leu Leu 


Pro vai Asn Gly 


Glu Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 




530 
















Pro 


Trp 


His Ser 


pne Giy Ala Asp 


ber val Pro Ala 


Asn 


Thr 


Glu 


Asn 








CCA 


c c c 
ODD 








CCA 

DDL) 


Glu 


Val 


Glu Pro 


Val Asp Ala Arg 


Pro Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 








etc 

boo 


O / U 






C7C 
0 ID 




Thr 


Arg 


Pro Gly 


Ser Gly Leu Thr 


Asn He Lys Thr 


Glu 


Glu 


Tl 

He 


Ser 






580 




CQC 




con 






p i „, 

GlU 


Val 


Asn Leu 


Asp Ala Glu Pne 


Arg His Asp Ser 


Gly 


Tyr 


Glu 


TT— 1 

Val 






595 


en a 
bUU 




enc 
oUo 








TT J c 
HIS 


His 


Gin Val 


Leu vai pne pne 


Ala giu Asp vai 


PI,, 

Gly 


Ser 


Asn 


Lys 




610 




CI C 

olo 


bzU 








Gly 


Ala 


He He 


Gly Leu Met val 


p T _ _ p "1 , , TT_ 1 TT-l 

Gly Gly Val val 


Tl ^ 

lie 


Ala 


Thr 


Val 








C*3 A 


OJ J 








C A A 


lie 


Val 


He Thr 


Leu val Met Leu 


Lys Lys Lys Lys 


Leu 


Cly 


Thr 


Glu 










650 






fee 




Leu 


Gly Ser Thr 


Ser Pro Val Trp 


Trp Asn Ser Ala 


Asp 


Tl ~ 

lie 


Leu 


Glu 






660 




boo 


b /u 






Pro 


Val 


Asp Pro 


Arg Leu giu rro 


rn>^* t ^ r « tt — « --w 

irp iiys his irro 


Gly 


Ser 


pi — 
Gin 


Pro 






675 


680 




685 








Lys 


Thr Ala Cys 


Thr Asn Cys Tyr 


Cys Lys Lys Cys 


Cys 


Phe 


His 


Cys 




690 




695 


700 










Gin 


Val 


Cys Phe 


Met Thr Lys Ala 


Leu Gly He Ser 


Tyr 


Gly 


Arg 


Lys 


705 






710 


715 








720 


Lys 


Arg Arg Gin 


Arg Arg Arg Ala 


His Gin Asn Ser 


Gin 


Thr 


His 


Gin 








725 


730 






735 




Ala 


Ser 


Leu Ser 


Lys Gin Arg He 


Ser Ser Thr Val 


Ala 


Ala 


Ala 


Asp 






740 




745 




750 






Ala 


Ala Val Thr 


Pro Glu Glu Arg 


His Leu Ser Lys 


Met 


Gin 


Gin 


Asn 






755 


760 


765 








Gly 


Tyr 


Glu Asn 


Pro Thr Tyr Lys. 


Phe Phe Glu Gin 


Met 


Gin 


Asn 






770 




775 


780 











<210> 3 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 3 
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atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttataa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca acttgcatga ctacggcatg 
ggggtagagt ttgtgtgttg cccactggct 
gcggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
gacaagtatc tcgagacacc tggggatgag 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaaagaa cttgcctaaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
cgcatggtgg atcccaagaa agccgctcag 
gtgatttatg agcgcatgaa tcagtctctc 
gaggagattc aggatgaagt tgatgagctg 
gtcttggcca acatgattag tgaaccaagg 
tctttgaccg aaacgaaaac caccgtggag 
gacgatctcc agccgtggca ttcttttggg 
gaagttgagc ctgttgatgc ccgccctgct 
tctgggttga caaatatcaa gacggaggag 
cgacatgact caggatatga agttcatcat 
ggttcaaaca aaggtgcaat cattggactc 
atcgtcatca ccttggtgat gctgaagaag 
agtccagtgt ggtggaattc tgcagatatc 
tggaagcatc caggaagtca gcctaaaact 
tgctttcatt gccaagtttg tttcatgaca 
aagcggagac agcgacgaag agctcatcag 
aagcagagga tatccagcac agtggcggcc 
cacctgtcca agatgcagca gaacggctac 
atgcagaact ag 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 540 
gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccgtt 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcatgagaga atgggaagag 1020 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggtggag 1140 
aatgaccgcc gccgcctggc cctggagaac 12 00 
cggcctcgtc acgtgttcaa tatgctaaag 1260 
cagcacaccc taaagcattt cgagcatgtg 1320 
atccggtccc aggttatgac acacctccgt 1380 
tccctgctct acaacgtgcc tgcagtggcc 1440 
cttcagaaag agcaaaacta ttcagatgac 1500 
atcagttacg gaaacgatgc tctcatgcca 1560 
ctccttcccg tgaatggaga gttcagcctg 1620 
gctgactctg tgccagccaa cacagaaaac 1680 
gccgaccgag gactgaccac tcgaccaggt 1740 
atctctgaag tgaagatgga tgcagaattc 1800 
caagtattgg tgttctttgc agaagatgtg 1860 
atggtgggcg gtgttgtcat agcgacagtg 1920 
aaaaagcttg gtaccgagct cggatccact 1980 
ctggagccag tagatcctag actagagccc 2040 
gcttgtacca attgctattg taaaaagtgt 2100 
aaagccttag gcatctccta tggcaggaag 2160 
aacagtcaga ctcatcaagc ttctctatca 2220 
gcagacgccg ctgtcacccc agaggagcgc 2280 
gaaaatccaa cctacaagtt ctttgagcag 2340 

2352 
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Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala 


Ala 


Trp 


Thr 


Ala 


Arg 


1 








5 










10 










15 


Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 


Asn 


Ala 


Gly Leu 


Leu 


Ala 


Glu 


Pro 








20 










25 










30 






Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly 


Arg 


Leu 


Asn 


Met 


His 


Met 


Asn 


Val 


Gin 






35 










40 










45 








Asn 


Gly 


Lys 


Trp 


Asp 


Ser 


Asp 


Pro 


Ser 


Gly 


Thr 


Lys 


Thr 


Cys 


He 


Asp 




50 










55 










60 










Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


Cys 


Gin 


Glu 


Val 


Tyr 


Pro 


Glu 


Leu 


65 










70 










75 










80 


Gin 


He 


Thr 


Asn 


Val 


Val 


Glu 


Ala 


Asn 


Gin 


Pro 


Val 


Thr 


He 


Gin 


Asn 










85 










90 










95 




Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys 


Gin 


Cys 


Lys 


Thr 


His 


Pro 


His 


Phe 


Val 








100 










105 










110 






He 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 


Glu 


Phe 


He 


Ser 


Asp 


Ala 


Leu 


Leu 
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115 



Val 


Pro 


Asp 


Lys 




130 






Glu 


Thr 


His 


Leu 


145 








Lys 


Ser 


Thr 


Asn 


Asp 


Lys 


Phe 


Arg 








180 


Ser 


Asp 


Asn 


Val 






195 




Trp 


Trp 


Gly 


Gly 




210 






Val 


Val 


Glu 


Val 


225 








Glu 


Ala 


Asp 


Asp 


Glu 


Ala 


Glu 


Glu 








260 


Ala 


Thr 


Thr 


Thr 






275 




Val 


Pro 


Thr 


Thr 




290 






Glu 


Thr 


Pro 


Gly 


305 








Glu 


Arg 


Leu 


Glu 


Glu 


Trp 


Glu 


Glu 








340 


Lys 


Lys 


Ala 


Val 






355 




Gin 


Glu 


Ala 


Ala 




370 






Arg 


Val 


Glu 


Ala 


385 








Tyr 


lie 


Thr 


Ala 


Asn 


Met 


Leu 


Lys 








420 


Thr 


Leu 


Lys 


His 






435 




Ala 


Gin 


He 


Arg 




450 






Arg 


Met 


Asn 


Gin 


465 








Glu 


Glu 


He 


Gin 


Tyr 


Ser 


Asp 


Asp 








500 


Tyr 


Gly 


Asn 


Asp 






515 




Val 


Glu 


Leu 


Leu 




530 






Pro 


Trp 


His 


Ser 


545 








Glu 


Val 


Glu 


Pro 


Thr 


Arg 


Pro 


Gly 








580 


Glu 


Val 


Lys 


Met 






595 




His 


His 


Gin 


Val 









1/ U 


Cys 


Lys 


irne 


Leu 






1 ^ 
1j D 




His 


m 

Trp 


ISA 

HIS 


Tnr 




150 






Leu 


His 


Asp 


Tyr 


165 








Gly 


Val 


Glu 


Phe 


Asp 


Ser 


TV -1 

Ala 


Asp 








o r\ n 


Ala 


Asp 


Thr 


ASp 






Z±D 




Ala 


Glu 


Glu 


Glu 




230 






Asp 


Glu 


Asp 


Asp 


245 








Pro 


Tyr 


Glu 


Glu 


Thr 


Thr 


Thr 


Thr 








280 


Ala 


Ala 


Ser 


Thr 






295 




Asp 


Glu 


Asn 


Glu 




310 






Ala 


Lys 


His 


Arg 


325 








Ala 


Glu 


Arg 


Gin 


He 


Gin 


His 


Pne 








360 


Asn 


Glu 


Arg 


Gin 






375 




Met 


Leu 


Asn 


Asp 




390 






Leu 


Gin 


Ala 


Val 


405 








Lys 


Tyr 


.Val 


Arg 


Phe 


Glu 


His 


Val 








A A f\ 

440 


Ser 


Gin 


Val 


Met 






A c tz 




Ser 


Leu 


Ser 


Leu 




470 






Asp 


Glu 


Val 


Asp 


485 








Val 


Leu 


Ala 


Asn 


Ala 


Leu 


Met 


Pro 








520 


Pro 


Val 


Asn 


Gly 






535 




Phe 


Gly 


Ala 


Asp 




550 






Val 


Asp 


Ala 


Arg 


565 








Ser 


Gly 


Leu 


Thr 


Asp 


Ala 


Glu 


Phe 






600 


Leu 


Val 


Phe 


Phe 



fil S 


Pi 7-1 

bin 


V?1U 


Arg 








14U 


vai 


Ala 


Lys 


GlU 






ICC 




Gly 


Met 


Leu 


Leu 




170 






val 


Cys 


Cys 


Pro 










Ala 


GlU 


GlU 


Asp 


Tyr 


Ala 


Asp 


Gly 








ZZ U 


G1U 


vai 


Ala 


GlU 






o o. c 




Glu 


Asp 


Gly 


Asp 




250 






Ala 


Thr 


GlU 


Arg 


o c c 
ZDD 








Glu 


Ser 


vai 


GlU 


Pro 


Asp 


Ala 


Val 








300 


His 


Ala 


His 


pne 






315 




Glu 


Arg 


Met 


Ser 




330 






Ala 


Lys 


Asn 


Leu 


"i A C 

345 








Gin 


Glu 


Lys 


Val 


Gin 


Leu 


Val 


Glu 








380 


Arg 


Arg 


Arg 


Leu 






395 




Pro 


Pro 


Arg 


Pro 




410 






Ala 


Glu 


Gin 


Lys 


/IOC 








Arg 


Met 


vai 


Asp 


Tnr 


HIS 


Leu 


Arg 








A C A 

460 


Leu 


Tyr 


Asn 


vai 






A *7 C 
4 / D 




Glu 


Leu 


Leu 


Gin 




490 






Met 


He 




ul U. 


505 








Ser 


Leu 


Thr 


Glu 


Glu 


Phe 


Ser 


Leu 








540 


Ser 


Val 


Pro 


Ala 






555 




Pro 


Ala 


Ala 


Asp 




570 






Asn 


He 


Lys 


Thr 


585 








Arg 


His 


Asp 


Ser 


Ala 


Glu 


Asp 


Val 



±Z 0 








Met 


Asp Val 


Cys 


Thr 


Cys 


Ser 


GlU 








lbu 


Pro 


Cys Gly 


Tl ~ 

lie 






175 




Leu 


Ala 


Glu 


GlU 




190 






Asp 


Ser Asp 


Val 


one. 








Ser 


Glu Asp 


Lys 


Val 


Glu 


Glu 


GlU 








*5 A fi 


GlU 


Val 


Glu 


GlU 






255 




Thr 


Thr 


Ser 


lie 




270 






Glu 


Val 


Val 


Arg 


285 








Asp 


Lys 


Tyr 


Leu 


Gin 


Lys 


Ala 


Lys 








O O C\ 


Gin 


Val 


Met 


Arg 






335 




Pro 


Lys 


Ala 


Asp 




350 






Glu 


Ser 


Leu 


Glu 


365 








Thr 


His 


Met 


Ala 


Ala 


Leu 


Glu 


Asn 








400 


Arg 


His 


Val 


Phe 






415 




Asp 


Arg 


Gin 


His 




430 






Pro 


Lys 


Lys 


Ala 


445 








Val 


He 


Tyr 


Glu 


Pro 


Ala 


Val 


Ala 








480 


Lys 


Glu 


Gin 


Asn 




495 




Pro 


Arg 


He 


Ser 




510 






Thr 


Lys 


Thr 


Thr 


525 








Asp 


Asp 


Leu 


Gin 


Asn 


Thr 


Glu 


Asn 








560 


Arg 


Gly Leu 


Thr 






575 




Glu 


Glu 


He 


Ser 




590 






Gly 


Tyr 


Glu 


Val 


605 








Gly 


Ser 


Asn 


Lys 
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OXU OlD 






0 A U 










Gly 


Ala lie lie Gly Leu Met 


Val Gly Gly Val 


vai 


lie 


Aia 


inr 


val 


CO E 


con 




635 












lie 


Val He Tnr Leu Val Met 


Leu Lys Lys 


Lys 


Lys 


Leu 


biy 


Tnr 


blU 




c ^ c 

645 


650 










ODD 




Leu 


Gly Ser Thr Ser Pro Val 


Trp Trp Asn 


Ser 


TV 1 -n 

Ala 


Asp 


lie 


Leu 


pi,, 
LxlU 




CCA 
ODU 


665 








b / U 






Pro 


Val Asp Pro Arg Leu Glu 


Pro Trp Lys 


His 


Pro 




oer 


bin 


Pro 




675 


680 






685 








Lys 


Thr Ala Cys Thr Asn Cys 


Tyr Cys Lys 


Lys 


Cys 


Cys 


Phe 


His 


Cys 


690 695 






700 










Gin 


Val Cys Phe Met Thr Lys 


Ala Leu Gly He 


Ser 


Tyr 


Gly 


Arg 


Lys 


705 


710 




715 










720 


Lys 


Arg Arg Gin Arg Arg Arg 


Ala His Gin 


Asn 


Ser 


Gin 


Thr 


His 


Gin 




725 


730 










735 




Ala 


Ser Leu Ser Lys Gin Arg 


He Ser Ser 


Thr 


Val 


Ala 


Ala 


Ala 


Asp 




740 


745 








750 






Ala 


Ala Val Thr Pro Glu Glu 


Arg His Leu 


Ser 


Lys 


Met 


Gin 


Gin 


Asn 




755 


760 






765 








Gly 


Tyr Glu Asn Pro Thr Tyr 


Lys Phe Phe 


Glu 


Gin 


Met 


Gin 


Asn 




770 775 






780 











<210> 5 
<211> 2823 
<212> DNA 

<213> fusion protein 



<400> 5 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 



human 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaatctaga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
cggatccact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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agtccagtgt ggtggaattc tgcagatatc 
gatatttgcc gacttaaaaa gctcaagtgc 
ctgaagaaca actgggagtg tcgctactct 
gcacatctga cagaagtgga atcaaggcta 
tttcctcgag aagaccttga catgattttg 
ttgttaacag gattatttgt acaagataat 
gcttcagtgg agactgatat gcctctaaca 
tcatcggaag agagtagtaa caaaggtcaa 
gatctggccc ccccgaccga tgtcagcctg 
gtggcgatgg cgcatgccga cgcgctagac 
gattccccgg gtccgggatt taccccccac 
gccgacttcg agtttgagca gatgtttacc 
gatatccagc acagtggcgg ccgcgacgcc 
aagatgcagc agaacggcta cgaaaatcca 
tag 

<210> 6 
<211> 941 
<212> PRT 

<213> fusion protein - human 



<400> 6 


















Met 


Leu 


Pro 


Gly Leu Ala Leu 


Leu Leu Leu 


Ala Ala 


Trp 


Thr 


Ala 


Arg 


1 






5 


10 






15 




Ala 


Leu 


Glu 


Val Pro Thr Asp 


Gly Asn Ala 


Gly Leu 


Leu 


Ala 


Glu 


Pro 








20 


25 






30 






Gin 


He 


Ala 


Met Phe Cys Gly 


Arg Leu Asn 


Met His 


Met 


Asn 


Val 


Gin 






35 




40 




45 








Asn 


Gly 


Lys 


Trp Asp Ser Asp 


Pro Ser Gly 


Thr Lys 


Thr 


Cys 


He 


Asp 




50 




55 




60 










Thr 


Lys 


Glu 


Gly He Leu Gin 


Tyr Cys Gin 


Glu Val 


Tyr 


Pro 


Glu 


Leu 


65 






70 




75 








80 


Gin 


He 


Thr 


Asn Val Val Glu 


Ala Asn Gin 


Pro Val 


Thr 


He 


Gin 


Asn 








85 


90 








95 




Tip 


Cys 


Lys 


Arg Gly Arg Lys 


Gin Cys Lys 


Thr His 


Pro 


His 


Phe 


Val 








100 


105 






110 






He 


Pro 


Tyr 


Arg Cys Leu Val 


Gly Glu Phe 


He Ser 


Asp 


Ala 


Leu 


Leu 






115 




120 




125 








Val 


Pro 


Asp 


Lys Cys Lys Phe 


Leu His Gin 


Glu Arg 


Met 


Asp 


Val 


Cys 




130 




135 




140 










Glu 


Thr 


His 


Leu His Trp His 


Thr Val Ala 


Lys Glu 


Thr 


Cys 


Ser 


Glu 


145 






150 




155 






160 


Lys 


Ser 


Thr 


Asn Leu His Asp 


Tyr Gly Met 


Leu Leu 


Pro 


Cys 


Gly 


He 








165 


170 








175 




Asp 


Lys 


Phe 


Arg Gly Val Glu 


Phe Val Cys 


Cys Pro 


Leu 


Ala 


Glu 


Glu 








180 


185 






190 






Ser 


Asp 


Asn 


Val Asp Ser Ala 


Asp Ala Glu 


Glu Asp 


Asp 


Ser 


Asp 


Val 






195 




200 




205 








Trp 


Trp 


Gly 


Gly Ala Asp Thr 


Asp Tyr Ala 


Asp Gly 


Ser 


Glu 


Asp 


Lys 




210 




215 




220 










Val 


Val 


Glu 


Val Ala Glu Glu 


Glu Glu Val 


Ala Glu 


Val 


Glu 


Glu 


Glu 


225 






230 




235 








240 


Glu 


Ala 


Asp 


Asp Asp Glu Asp 


Asp Glu Asp 


Gly Asp 


Glu 


Val 


Glu 


Glu 








245 


250 






255 




Glu 


Ala 


Glu 


Glu Pro Tyr Glu 


Glu Ala Thr 


Glu Arg 


Thr 


Thr 


Ser 


He 








260 


265 






270 






Ala 


Thr 


Thr 


Thr Thr Thr Thr 


Thr Glu Ser 


Val Glu 


Glu 


Val 


Val 


Arg 






275 




280 




285 






Val 


Pro 


Thr 


Thr Ala Ala Ser 


Thr Pro Asp 


Ala Val 


Asp 


Lys 


Tyr 


Leu 




290 




295 


300 




Glu 


Thr 


Pro 


Gly Asp Glu Asn 


Glu His Ala 


His Phe 


Gin 


Lys 


Ala 


Lys 


305 






310 




315 






320 



aagctactgt cttctatcga acaagcatgc 2040 
tccaaagaaa aaccgaagtg cgccaagtgt 2100 
cccaaaacca aaaggtctcc gctgactagg 2160 
gaaagactgg aacagctatt tctactgatt 2220 
aaaatggatt ctttacagga tataaaagca 2280 
gtgaataaag atgccgtcac agatagattg 2340 
ttgagacagc atagaataag tgcgacatca 2400 
agacagttga ctgtatcggg aattcccggg 2460 
ggggacgagc tccacttaga cggcgaggac 252 0 
gatttcgatc tggacatgtt gggggacggg 2580 
gactccgccc cctacggcgc tctggatatg 2640 
gatgcccttg gaattgacga gtacggtggg 2700 
gctgtcaccc cagaggagcg ccacctgtcc 2760 
acctacaagt tctttgagca gatgcagaac 2820 

2823 
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blU 


Arg 


Leu 


Glu Ala Lys His Arg Glu 








J^b 


GlU 


Trp 


GlU 


Glu Ala Glu Arg Gin Ala 








340 34b 


Lys 


Lys 


Ala 


Val lie Gin His Pne Gin 






T C C 

355 


*5 r r\ 

360 


Gin 


Glu 


Ala 


Ala Asn Glu Arg Gin Gin 




•a r 7 n 

370 




375 


Arg 


Val 


Glu 


Ala Met Leu Asn Asp Arg 


*i Q C 

385 






390 


Tyr 


lie 


Thr 


Ala Leu Gin Ala Val Pro 






405 


Asn 


Met 


Leu 


Lys Lys Tyr Val Arg Ala 








420 425 


Thr 


Leu 


Lys 


His Pne Glu His Val Arg 






A O C 

435 


a a n 
440 


Ala. 


Gin 


lie 


Arg Ser Gin Val Met Thr 




450 




455 


Arg 


Met 


Asn 


Gin Ser Leu Ser Leu Leu 


465 






470 


Glu 


Glu 


lie 


Gin Asp Glu Val Asp Glu 








485 


Tyr 


Ser 


Asp 


Asp Val Leu Ala Asn Met 








500 505 


Tyr 


Gly 


Asn 


Asp Ala Leu Met Pro Ser 






515 


520 


Val 


Glu 


Leu 


Leu Pro Val Asn Gly Glu 




530 




535 


Pro 


Trp 


His 


Ser Phe Gly Ala Asp Ser 


545 






550 


Glu 


Val 


Glu 


Pro Val Asp Ala Arg Pro 








565 


Thr 


Arg 


Pro 


Gly Ser Gly Leu Thr Asn 








580 585 


Glu 


Val 


Asn 


Leu Asp Ala Glu Phe Arg 






595 


600 


His 


His 


Gin 


Val Leu Val Phe Phe Ala 




610 




615 


Gly 


Ala 


lie 


lie Gly Leu Met Val Gly 


625 






a a 

630 


lie 


Val 


lie 


Thr Leu Val Met Leu Lys 








645 


Leu 


Gly 


Ser 


Thr Ser Pro Val Trp Trp 








660 665 


Leu 


Ser 


Ser 


lie Glu Gin Ala Cys Asp 






675 


680 


Lys 


Cys 


Ser 


Lys Glu Lys Pro Lys Cys 




o ^ u 




027 J 


Trp 


Glu 


Cys 


Arg Tyr Ser Pro Lys Thr 


705 






710 


Ala 


His 


Leu 


Thr Glu Val Glu Ser Arg 








725 


Phe 


Leu 


Leu 


lie Phe Pro Arg Glu Asp 








740 745 


Asp 


Ser 


Leu 


Gin Asp lie Lys Ala Leu 






755 


760 


Asp 


Asn 


Val 


Asn Lys Asp Ala Val Thr 




770 




775 


Thr 


Asp 


Met 


Pro Leu Thr Leu Arg Gin 


785 






790 


Ser 


Ser 


Glu 


Glu Ser Ser Asn Lys Gly 



805 
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Arg Mec 


Pay. 

ber 


Gin 


vai 


Mct 


Arg 










j J 3 




Lys Asn 


Leu 


Pro 


Lys 


Til a 

Aia 


Asp 








OCA 






Glu Lys 


val 


GlU 


Ser 


Leu 


(jlU 






ice 

Job 








Leu Val 


GlU 


Thr 


HIS 


■pur 4- 

Mec 


Aia 




380 










Arg Arg 


Leu 


Ala 


Leu 


/Ml 

GlU 


Asn 












a on 
4t uu 


Pro Arg 


Pro 


Arg 


His 


vai 


pne 


a i a 
410 






/I 1 c 

41b 




(jlU din 


Lys 


Asp 


Arg 


bin 


HIS 














Met Val 


Asp 


Pro 


Lys 


Lys 


Ala 






445 








His Leu 


Arg 


Val 


lie 


Tyr 


Glu 




A Cf\ 

4bU 










Tyr Asn 


Val 


Pro 


Ala 


Val 


Ala 


475 










A O A 

480 


Leu Leu 


Gin 


Lys 


Glu 


Gin 


Asn 


a o a 
49 U 








/IOC 

4y b 




lie Ser 


Glu 


Pro 


Arg 


lie 


Ser 








CIA 

blU 






Leu Thr 


Glu 


Thr 


Lys 


Thr 


Thr 






525 








Phe Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




c a a 
b4U 










Val Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


555 










CCA 

bbU 


Ala Ala 


Asp 


Arg 


Gly 


Leu 


Thr 


C *7 A 

5 /0 








C 1 c 

b /b 




lie Lys 


Thr 


Glu 


Glu 


He 


Ser 






con 

by u 






His Asp 


Ser 


Gly 


Tyr 


Glu 


val 






foe 

bub 








Glu Asp 


Val 


Gly 


Ser 


Asn 


Lys 




620 










Gly Val 


Val 


lie 


si- 
Ala 


Thr 


vai 


r -j c 

635 










C A A 


Lys Lys 


Lys 


Leu 


Gly 


Thr 


bill 


650 








c c c 

obb 




Asn Ser 


Ala 


Asp 


lie 


Lys 


Leu 








fin 

o / U 






lie Cys 


Arg 


Leu 


Lys 


Lys 


Leu 






685 








Ala Lys 


Cys 


Leu 


Lys 


Asn 


Asn 




700 










Lys Arg 


Ser 


Pro 


Leu 


Thr 


Arg 


715 










720 


Leu Glu 


Arg 


Leu 


Glu 


Gin 


Leu 


730 








735 




Leu Asp 


Met 


He 


Leu 


Lys 


Met 








750 






Leu Thr 


Gly 


Leu 


Phe 


Val 


Gin 






765 








Asp Arg 


Leu 


Ala 


Ser 


Val 


Glu 




780 










His Arg 


lie 


Ser 


Ala 


Thr 


Ser 


795 










800 


Gin Arg 


Gin 


Leu 


Thr 


Val 


Ser 


810 








815 
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lie 


Prn fZ~\ \r 7i en T.on Jil a' Prn Prn TTit* A qfi 
Jtr I. u u±y ao^J JJ6U rlJ-O. .t-L ^ nu nu. rvojj 


Val 


Ser 


Leu 


Gly 


Asp 






ftps 






830 






Glu 


Leu 


Hi«5 Tjpu A«;n fJlv Glu Asn Val Ala Met 


Ala 


His 


Ala 


Asd 


Ala 






835 ~ 840 




845 








Leu 


Asp 


Asp Phe Asp Leu Asp Met Leu Gly Asp 


Gly 


Asp 


Ser 


Pro 


Gly 




850 


855 


860 










Pro 


Gly 


Phe Thr Pro His Asp Ser Ala Pro Tyr 


Gly 


Ala 


Leu 


Asp 


Met 


865 




870 875 










880 


Ala 


Asp 


Phe Glu Phe Glu Gin Met Phe Thr Asp 


Ala 


Leu 


Gly 


He 


Asp 




885 890 








895 




Glu 


Tyr 


Gly Gly Asp lie Gin His Ser Gly Ala 


Ala 


Ala 


Asp 


Ala 


Ala 




900 905 






910 






Val 


Thr 


Pro Glu Glu Arg His Leu Ser Lys Met 


Gin 


Gin 


Asn 


Gly 


Tyr 






915 920 




925 








Glu 


Asn 


Pro Thr Tyr Lys Phe Phe Glu Gin Met 


Gin 


Asn 










930 


935 


940 











<210> 7 

<211> 2823 

<212> DNA 

<213> fusion protein - human 



<400> 7 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgeattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

geggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

geagaaegtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

ategtcatea 

agtccagtgt 

gatatttgee 

ctgaagaaca 

gcacatctga 

tttcctcgag 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgeect 
gegaaactea 
acttgeatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagecacaga 
aagaggtggt 
tcgagacacc 
aggecaagea 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gegcagaaca 
atcccaagaa 
agegcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
gacttaaaaa 
actgggagtg 
cagaagtgga 
aagaccttga 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
etaeggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
ettgectaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gaeggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgeagatate 
gctcaagtgc 
tcgctactct 
atcaaggcta 
catgattttg 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgee 
ccagtgacca 
tttgtgattc 
gaeaagtgea 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
ageattgeca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 
aagctactgt 
tccaaagaaa 
cccaaaacca 
gaaagactgg 
aaaatggatt 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgeaagegg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgetgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgetaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgeagaatte 
agaagatgtg 
agegacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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ttgttaacag gattatttgt acaagataat 

gcttcagtgg agactgatat gcctctaaca 

tcatcggaag agagtagtaa caaaggtcaa 

gatctggccc ccccgaccga tgtcagcctg 

gtggcgatgg cgcatgccga cgcgctagac 

gattccccgg gtccgggatt taccccccac 

gccgacttcg agtttgagca gatgtttacc 

gatatccagc acagtggcgg ccgcgacgcc 

aagatgcagc agaacggcta cgaaaatcca 
tag 



gtgaataaag atgccgtcac agatagattg 2340 

ttgagacagc atagaataag tgcgacatca 2400 

agacagttga ctgtatcggg aattcccggg 2460 

ggggacgagc tccacttaga cggcgaggac 252 0 

gatttcgatc tggacatgtt gggggacggg 2580 

gactccgccc cctacggcgc tctggatatg 2640 

gatgcccttg gaattgacga gtacggtggg 2700 

gctgtcaccc cagaggagcg ccacctgtcc 2760 

acctacaagt tctttgagca gatgcagaac 2820 

2823 



<210> 8 
<211> 941 
<212> PRT 

<213> fusion protein - human 



<400> 8 



Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


1 








5 








Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 








20 










Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly 


Arg 






35 










40 


Asn 


Gly 


Lys 


Trp 


Asp 


Ser 


Asp 


Pro 




50 










55 




Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


65 










70 






Gin 


He 


Thr 


Asn 


Val 


Val 


Glu 


Ala 










85 








Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys 


Gin 








100 










lie 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 






115 










120 


Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 




130 










135 




Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


145 










150 






Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 










165 








Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 








180 










Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 






195 










200 


Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 




210 










215 




Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


225 










230 






Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 










245 








Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 








260 










Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 






275 










280 


Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 




290 










295 




Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


305 






310 






Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 










325 








Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 








340 










Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 



Leu 


Leu 


Ala 


Ala 


Trp 


Thr 


Ala 


Arg 




10 










15 




Asn 


Ala 


Gly 


Leu 


Leu 


Ala 


Glu 


Pro 


25 










30 






Leu 


Asn 


Met 


His 


Met 


Asn 


Val 


Gin 










45 








Ser 


Gly 


Thr 


Lys 


Thr 


Cys 


He 


Asp 








60 










Cys 


Gin 


Glu 


Val 


Tyr 


Pro 


Glu 


Leu 






75 










80 


Asn 


Gin 


Pro 


Val 


Thr 


He 


Gin 


Asn 




90 










95 




Cys 


Lys 


Thr 


His 


Pro 


His 


Phe 


Val 


105 










110 






Glu 


Phe 


He 


Ser 


Asp 


Ala 


Leu 


Leu 










125 








His 


Gin 


Glu 


Arg 


Met 


Asp 


Val 


Cys 








140 










Val 


Ala 


Lys 


Glu 


Thr 


Cys 


Ser 


Glu 






155 










160 


Gly 


Met 


Leu 


Leu 


Pro 


Cys 


Gly 


He 




170 










175 




Val 


Cys 


Cys 


Pro 


Leu 


Ala 


Glu 


Glu 


185 










190 






Ala 


Glu 


Glu 


Asp 


Asp 


Ser 


Asp 


Val 










205 








Tyr 


Ala 


Asp 


Gly 


Ser 


Glu 


Asp 


Lys 








220 










Glu 


Val 


Ala 


Glu 


Val 


Glu 


Glu 


Glu 






235 










240 


Glu 


Asp 


Gly 


Asp 


Glu 


Val 


Glu 


Glu 




250 










255 




Ala 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


He 


265 










270 






Glu 


Ser 


Val 


Glu 


Glu 


Val 


Val 


Arg 










285 








Pro 


Asp 


Ala 


Val 


Asp 


Lys 


Tyr 


Leu 








300 










His 


Ala 


His 


Phe 


Gin 


Lys 


Ala 


Lys 






315 










320 


Glu 


Arg 


Met 


Ser 


Gin 


Val 


Met 


Arg 




330 










335 


Ala 


Lys 


Asn 


Leu 


Pro 


Lys 


Ala 


Asp 


345 










350 




Gin 


Glu 


Lys 


Val 


Glu 


Ser 


Leu 


Glu 
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355 360 365 



Gill 


blU 


Aia 


Aia 


Asn 


bill 


Arg 


bin 


bin 


Leu 


vai 


GlU 


inr 


nlS 


rieu 


A±a 




j / U 










3 /O 




















Arg 


val 


GlU 


Aia 


Mec 


Leu 


Asn 


Asp 


Arg 


Arg 


Arg 


T /->t -i 

Lieu 


Aia 


Leu 


GlU 


Asn 


Ton 

385 










jy u 










one 

jyb 










4UU 


Tyr 


lie 


inr 


Ala 


Leu 


Gin 


Aia 


vai 


Pro 


Pro 


Arg 


pro 


Arg 


HIS 


vai 


73 Via 

rne 








4Ub 










41U 








/lie 
41D 




Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


val 


Arg 


Ala 


GlU 


Gin 


Lys 


Asp 


Arg 


Gin 


TT 4 <-» 

HIS 








/ion 

420 










/IOC 

4^b 










a *a n 
4 J U 






Thr 


Leu 


Lys 


His 


Pne 


Glu 


HIS 


vai 


Arg 


Met 


vai 


ASp 


Pro 


Lys 


Lys 


Aia 






435 










440 










A A C 

44b 








Ala 


Gin 


lie 


Arg 


Ser 


Gin 


Val 


Met 


Thr 


His 


Leu 


Arg 


vai 


lie 


Tyr 


GlU 




450 










455 










/en 
45U 










Arg 


Met 


Asn 


Gin 


Ser 


Leu 


Ser 


Leu 


Leu 


Tyr 


Asn 


vai 


Pro 


TV 1 -» 

Aia 


vai 


Ala 


Arc 

465 










470 








475 










a q n 
4oU 


Glu 


Glu 


lie 


Gin 


Asp 


Glu 


Val 


Asp 


GlU 


Leu 


Leu 


Gin 


Lys 


GlU 


Gin 


Asn 










a a c 

485 










a on 

490 














Tyr 


Ser 


Asp 


Asp 


Val 


Leu 


Ala 


Asn 


Met 


lie 


Ser 


GlU 


Pro 


Arg 


lie 


Ser 








500 










c n c 

505 










blU 






Tyr 


Gly 


Asn 


Asp 


Ala 


Leu 


Met 


Pro 


Ser 


Leu 


Thr 


Glu 


Tnr 


Lys 


Thr 


Thr 




515 








C 1 A 

520 










c o c 
bzb 








Val 


Glu 


Leu 


Leu 


Pro 


Val 


Asn 


Gly 


Glu 


Pne 


Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




530 










535 










540 










Pro 


Trp 


His 


Ser 


Pne 


Gly 


Ala 


Asp 


Ser 


Val 


Pro 


Aia 


Asn 


Thr 


Glu 


Asn 


545 










550 










555 










c ^ n 

boO 


Glu 


Val 


Glu 


Pro 


Val 


Asp 


Ala 


Arg 


Pro 


Ala 


7\ -I _ 

Ala 


Asp 


Arg 


Gly 


Leu 


Thr 










565 










570 










575 




Thr 


Arg 


Pro 


Gly 


Ser 


Gly 


Leu 


Thr 


Asn 


lie 


Lys 


Thr 


GlU 


Glu 


lie 


Ser 








580 










585 










con 

590 






Glu 


Val 


Lys 


Met 


Asp 


Ala 


Glu 


Pne 


Arg 


His 


Asp 


Ser 


Gly 


Tyr 


GlU 


Val 






595 










600 










605 








His 


His 


Gin 


Val 


Leu 


Val 


Phe 


Phe 


Ala 


Glu 


Asp 


Val 


Gly 


Ser 


Asn 


Lys 




610 










615 










620 










Gly 


Ala 


lie 


lie 


Gly 


Leu 


Met 


Val 


Gly 


Gly 


Val 


Val 


lie 


Ala 


Thr 


Vai 


625 










630 










635 










640 


lie 


Val 


lie 


Thr 


Leu 


Val 


Met 


Leu 


Lys 


Lys 


Lys 


Lys 


Leu 


Gly 


Thr 


Glu 










645 










c c rk 










bob 




Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


Val 


Trp 


Trp 


Asn 


Ser 


Ala 


Asp 


lie 


Lys 


Leu 








660 










c c c 

665 










D / 0 






Leu 


Ser 


Ser 


lie 


Glu 


Gin 


Ala 


Cys 


Asp 


lie 


Cys 


Arg 


Leu 


Lys 


Lys 


Leu 






f n c 

675 










680 










c o c 

685 








Lys 


Cys 


Ser 


Lys 


Glu 


Lys 


Pro 


Lys 


Cys 


Ala 


Lys 


Cys 


Leu 


Lys 


Asn 


Asn 




690 










695 










700 










Trp 


Glu 


Cys 


Arg 


Tyr 


Ser 


Pro 


Lys 


Thr 


Lys 


Arg 


Ser 


Pro 


Leu 


Thr 


Arg 


705 










710 










715 










720 


Ala 


His 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Arg 


Leu 


GlU 


Arg 


Leu 


Glu 


Gin 


Leu 










725 










/ J u 










/ ob 




Phe 


Leu 


Leu 


lie 


Phe 


Pro 


ni y 


Glu 


Asp 


Leu 


Asp 


Met 


He 


Leu 


Lys 


Met 








740 










745 










750 






Asp 


Ser 


Leu 


Gin 


Asp 


lie 


Lys 


Ala 


Leu 


Leu 


Thr 


Gly 


Leu 


Phe 


Val 


Gin 






755 










760 










765 








Asp 


Asn 


Val 


Asn 


Lys 


Asp 


Ala 


Val 


Thr 


Asp 


Arg 


Leu 


Ala 


Ser 


Val 


Glu 




770 










775 










780 










Thr 


Asp 


Met 


Pro 


Leu 


Thr 


Leu 


Arg 


Gin 


His 


Arg 


He 


Ser 


Ala 


Thr 


Ser 


785 










790 










795 










onn 

u u u 


Ser 


Ser 


Glu 


Glu 


Ser 


Ser 


Asn 


Lys 


Gly 


Gin 


Arg 


Gin 


Leu 


Thr 


Val 


Ser 










805 








810 










815 




Gly 


lie 


Pro 


Gly 


Asp 


Leu 


Ala 


Pro 


Pro 


Thr 


Asp 


Val 


Ser 


Leu 


Gly 


Asp 








820 










825 










830 






Glu 


Leu 


His 


Leu 


Asp 


Gly 


Glu 


Asp 


Val 


Ala 


Met 


Ala 


His 


Ala 


Asp 


Ala 






835 










840 










845 








Leu 


Asp 


Asp 


Phe 


Asp 


Leu 


Asp 


Met 


Leu 


Gly 


Asp 


Gly 


Asp 


Ser 


Pro 


Gly 
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850 




855 




Pro 


Gly 


Phe Thr Pro His 


Asp 


Ser 


865 




870 




Ala 


Asp 


Phe Glu Phe Glu 


Gin 


Met 






885 






Glu 


Tyr 


Gly Gly Asp lie 


Gin 


His 






900 






Val 


Thr 


Pro Glu Glu Arg 


His 


Leu 






915 




920 


Glu 


Asn 


Pro Thr Tyr Lys 


Phe 


Phe 




930 




935 





860 

Ala Pro Tyr Gly Ala Leu Asp Met 
875 880 
Phe Thr Asp Ala Leu Gly lie Asp 

890 895 
Ser Gly Ala Ala Ala Asp Ala Ala 
905 910 
Ser Lys Met Gin Gin Asn Gly Tyr 
925 

Glu Gin Met Gin Asn 
940 



<210> 9 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 9 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 3 60 
gagtttataa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 72 0 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaatctaga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 
agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 
tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 
tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 
aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 
aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 
cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 
atgcagaact ag 2352 

<210> 10 
<211> 783 
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<212> PRT 

<213> fusion protein - human 



<400> 10 



Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


Leu 


1 






5 










Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 


Asn 








20 










25 


Gin 


lie 


Ala 


Met 


Phe 


Cys 


Gly 
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Leu 
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Gly 
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Glu 
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Leu 


Gin 


Tyr 


Cys 
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Val 
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Glu 


Ala 


Asn 
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Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys 


Gin 


Cys 








100 










105 


lie 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 


Glu 






115 










120 




Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 


His 




130 










135 






Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


Val 


145 










150 








Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 


Gly 










165 










Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 


Val 








180 










185 


Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 


Ala 






195 










200 




Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 


Tyr 




210 










215 






Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


Glu 


225 










230 








Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 










245 










Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 








260 










265 


Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 






275 










280 




Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 




290 










295 






Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


His 


305 










310 








Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 










325 










Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 








340 










345 


Lys 


Lys 


Ala 


Val 


lie 


Gin 


His 


Phe 


Gin 






355 










360 




Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 


Gin 




370 










375 






Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


Arg 


385 










390 








Tyr 


lie 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 


Pro 










405 










Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 


Ala 








420 










425 


Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 


Arg 






435 










440 




Ala 


Gin 


lie 


Arg 


Ser 


Gin 


Val 


Met 


Thr 




450 








455 
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Leu Ala Ala 


Trp 


Thr 


Ala 


Arg 


10 






15 


Ala Gly Leu 


Leu 


Ala 


Glu 


Pro 






30 






Asn Met His 


Met 


Asn 


Val 


Gin 




45 








Gly Thr Lys 


Thr 


Cys 


lie 


Asp 


60 










Gin Glu Val 


Tyr 


Pro 


Glu 


Leu 


75 








80 


Gin Pro Val 


Thr 


lie 


Gin 


Asn 


90 






95 




Lys Thr His 


Pro 


His 


Phe 


Val 






110 






Phe lie Ser 


Asp 


Ala 


Leu 


Leu 




125 








Gin Glu Arg 


Met 


Asp 


Val 


Cys 


140 










Ala Lys Glu 


Thr 


Cys 


Ser 


Glu 


155 








160 


Met Leu Leu 


Pro 


Cys 


Gly 


He 


170 






175 




Cys Cys Pro 


Leu 


Ala 


Glu 


Glu 






190 






Glu Glu Asp 


Asp 


Ser 


Asp 


Val 




205 








Ala Asp Gly 


Ser 


Glu 


Asp 


Lys 


220 










Val Ala Glu 


Val 


Glu 


Glu 


Glu 


235 








240 


Asp Gly Asp 


Glu 


Val 


Glu 


Glu 


250 






255 




Thr Glu Arg 


Thr 


Thr 


Ser 


lie 






270 






Ser Val Glu 


Glu 


Val 


Val 


Arg 




285 






Asp Ala Val 


Asp 


Lys 


Tyr 


Leu 


300 










Ala His Phe 


Gin 


Lys 


Ala 


Lys 


■51 C 










Arg Met Ser 


Gin 


Val 


Met 


Arg 


330 






335 


Lys Asn Leu 


Pro 


Lys 


Ala 


Asp 






350 




Glu Lys Val 


Glu 


Ser 


Leu 


Glu 




365 








Leu Val Glu 


Thr 


His 


Met 


Ala 


380 










Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


395 








400 


Pro Arg Pro 


Arg 


His 


Val 


Phe 


410 






415 




Glu Gin Lys 


Asp 


Arg 


Gin 


His 




430 






Met Val Asp 


Pro 


Lys 


Lys 


Ala 




445 








His Leu Arg 


Val 


lie 


Tyr 


Glu 



460 
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Arg 




ASH bin 
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ft / u 


blU 


blU 


lie bin 


Asp Glu Val Asp 








ftOD 


Tyr 


Ser 


Asp Asp 


val Leu Ala Asn 






c a a 




Tyr 


Gly 


Asix Asp 


Ala Leu Met Pro 






C 1 c 

515 


con 
520 


Val 


Glu 


Leu Leu 


Pro Val Asn Gly 




c *3 n 

530 




535 


Pro 


Trp 


His Ser 


Pne Gly Ala Asp 


C A C 

545 






CCA 


Glu 


Val 


Glu Pro 


Val Asp Ala Arg 








565 


Thr 


Arg 


Pro Gly 


Ser Gly Leu Thr 










Glu 


Val 


Asn Leu 


Asp Ala Glu Pne 






595 


600 


HIS 


His 


Gin Lys 


Leu Val Pne Pne 




a i a 
olU 




bio 


Gly 


Ala 


He He 


Gly Leu Met Val 


£ a c 

625 






C O A 


lie 


Val 


He Thr 


Leu Val Met Leu 








645 


Leu 


Gly 


Ser Thr 


Ser Pro Val Trp 






bbU 




Pro 


Val 


Asp Pro 


Arg Leu Glu Pro 






675 


680 


Lys 


Thr 


Ala Cys 


Thr Asn Cys Tyr 




690 




695 


Gin 


Val 


Cys Phe 


Met Thr Lys Ala 


705 






710 


Lys 


Arg 


Arg Gin 


Arg Arg Arg Ala 








725 


Ala 


Ser 


Leu Ser 


Lys Gin Arg He 






740 




Ala 


Ala 


Val Thr 


Pro Glu Glu Arg 






755 


760 


Gly 


Tyr 


Glu Asn 


Pro Thr Tyr Lys 




770 




775 



Leu 


Tyr Asn Val Pro 


Ala 


Val 


Ala 




A "7 C 

4/5 






480 


blU 


Leu Leu Gin Lys 


Glu 


Gin 


Asn 




A Q A 




A A C 

495 




Met 


He Ser Glu Pro 


Arg 


He 


Ser 


C A C 




CIA 

510 






Ser 


Leu Thr Glu Thr 


Lys 


Thr 


Thr 




525 






Glu 


Phe Ser Leu Asp 


Asp 


Leu 


Gin 




C A A 

540 






Ser 


Val Pro Ala Asn 


Thr 


Glu 


Asn 




555 






560 


Pro 


Ala Ala Asp Arg 


Gly 


Leu 


Thr 




570 




575 




Asn 


He Lys Thr Glu 


Glu 


He 


Ser 


rnc 

585 




590 






Arg 


His Asp Ser Gly 


Tyr 


Glu 


Val 




C A C • 








Ala 


Glu Asp Val Gly 


Ser 


Asn 


Lys 




620 








Gly 


Gly Val Val He 


Ala 


Thr 


Val 




635 






640 


Lys 


Lys Lys Lys Leu 


Gly 


Thr 


Glu 




650 




655 




Trp 


Asn Ser Ala Asp 


He 


Leu 


Glu 


c c c 

665 




670 






Trp 


Lys His Pro Gly 


Ser 


Gin 


Pro 




685 








Cys 


Lys Lys Cys Cys 


Phe 


His 


Cys 




700 








Leu 


Gly He Ser Tyr 


Gly 


Arg 


Lys 




715 






720 


His 


Gin Asn Ser Gin 


Thr 


His 


Gin 




730 




735 




Ser 


Ser Thr Val Ala 


Ala 


Ala 


Asp 


745 




750 




His 


Leu Ser Lys Met 


Gin 


Gin 


Asn 




765 








Phe 


Phe Glu Gin Met 


Gin 


Asn 





780 



<210> 11 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 11 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 

cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 

ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 

acctgeattg ataccaagga aggcatcctg cagtattgee aagaagtcta ccctgaactg 240 

cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgeaagegg 300 

ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 

gagtttataa gtgatgeect tctcgttcct gaeaagtgea aattcttaca ccaggagagg 420 

atggatgttt gegaaactea tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 

aagagtacca acttgeatga etaeggcatg ttgctgccct gcggaattga caagttccga 540 

ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgetgat 600 

geggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 

agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 

gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 

ccctacgaag aagecacaga gagaaccacc ageattgeca ccaccaccac caccaccaca 840 
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gagtctgtgg 
gacaagtatc 
gagaggcttg 
gcagaacgtc 
caggagaaag 
acacacatgg 
tacatcaccg 
aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcgtcatca 
agtccagtgt 
tggaagcatc 
tgctttcatt 
aagcggagac 
aagcagagga 
cacctgtcca 
atgcagaact 



aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
caggaagtca 
gccaagtttg 
agcgacgaag 
tatccagcac 
agatgcagca 
ag 



tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
t gat gage tg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gaeggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgeagatate 
gectaaaact 
tttcatgaca 
agctcatcag 
agtggcggcc 
gaaeggctae 



acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaaaagcttg 
ctggagccag 
gcttgtacca 
aaagecttag 
aacagtcaga 
gcagacgccg 
gaaaatccaa 



ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
tagatcctag 
attgetattg 
gcatctccta 
ctcatcaagc 
ctgtcacccc 
cctacaagtt 



tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgetaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgeagaatte 
agaagatgtg 
agegacagtg 
cggatccact 
actagagccc 
taaaaagtgt 
tggcaggaag 
ttctctatca 
agaggagege 
ctttgagcag 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2352 



<210> 12 
<211> 783 
<212> PRT 

<213> fusion protein - human 
<400> 12 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu 

15 10 
Ala Leu Glu Val Pro Thr Asp Gly Asn Ala 

20 25 
Gin lie Ala Met Phe Cys Gly Arg Leu Asn 

35 40 
Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly 

50 55 
Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin 
65 70 

Gin lie Thr Asn Val Val Glu Ala Asn Gin 

85 90 
Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys 

100 105 
lie Pro Tyr Arg Cys Leu Val Gly Glu Phe 

115 120 
Val Pro Asp Lys Cys Lys Phe Leu His Gin 

130 ~ 135 

Glu Thr His Leu His Trp His Thr Val Ala 
145 150 
Lys Ser Thr Asn Leu His Asp Tyr Gly Met 
165 170 
Asp Lys Phe Arg Gly Val Glu Phe Val Cys 

180 185 
Ser Asp Asn Val Asp Ser Ala Asp Ala Glu 

195 200 
Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala 

210 215 
Val Val Glu Val Ala Glu Glu Glu Glu Val 



Ala Ala Trp Thr 
Gly Leu 
Met His 



Thr Lys 

60 
Glu Val 
75 

Pro Val 

Thr His 

lie Ser 

Glu Arg 
140 
Lys Glu 
155 

Leu Leu 

Cys Pro 

Glu Asp 

Asp Gly 
220 
Ala Glu 



Leu Ala 

30 
Met Asn 
45 

Thr Cys 



Tyr Pro 

Thr lie 

Pro His 
110 
Asp Ala 
125 

Met Asp 

Thr Cys 

Pro Cys 

Leu Ala 
190 
Asp Ser 
205 

Ser Glu 



Ala Arg 
15 

Glu Pro 

Val Gin 

lie Asp 

Glu Leu 
80 

Gin Asn 
95 

Phe Val 

Leu Leu 

Val Cys 

Ser Glu 
160 
Gly He 
175 

Glu Glu 



Asp Val 
Asp Lys 
Val Glu Glu Glu 
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225 



230 



235 



240 



Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp 


Gly 


Asp 


Glu 


Val 


Glu 


Glu 










245 










250 










255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


He 








260 










265 










270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 


Ser 


Val 


Glu 


Glu 


Val 


Val 


Arg 






275 










280 










285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 


Asp 


Ala 


Val 


Asp 


Lys 


Tyr 


Leu 




290 










295 










300 










Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


His 


Ala 


His 


Phe 


Gin 


Lys 


Ala 


Lys 


305 








310 










315 










320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 


Arg 


Met 


Ser 


Gin 


Val 


Met 


Arg 










325 










330 










335 




Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 


Lys 


Asn 


Leu 


Pro 


Lys 


Ala 


Asp 






340 










345 










350 






Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 


Gin 


Glu 


Lys 


Val 


Glu 


Ser 


Leu 


Glu 






355 










360 










365 








Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 


Gin 


Leu 


Val 


Glu 


Thr 


His 


Met 


Ala 




370 










375 










380 










Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


Arg 


Arg 


Arg 


Leu 


Ala 


Leu 


Glu 


Asn 


385 










390 










395 










400 


Tyr 


lie 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 


Pro 


Pro 


Arg 


Pro 


Arg 


His 


Val 


Phe 










405 










410 










415 




Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 


Ala 


Glu 


Gin 


Lys 


Asp 


Arg 


Gin 


His 








420 










425 










430 






Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 


Arg 


Met 


Val 


Asp 


Pro 


Lys 


Lys 


Ala 






435 










440 










445 








Ala 


Gin 


He 


Arg 


Ser 


Gin 


Val 


Met 


Thr 


His 


Leu 


Arg 


Val 


He 


Tyr 


Glu 




450 








455 










460 










Arg 


Met 


Asn 


Gin 


Ser 


Leu 


Ser 


Leu 


Leu 


Tyr 


Asn 


Val 


Pro 


Ala 


Val 


Ala 


465 










470 










475 










480 


Glu 


Glu 


He 


Gin 


Asp 


Glu 


Val 


Asp 


Glu 


Leu 


Leu 


Gin 


Lys 


Glu 


Gin 


Asn 










485 










490 










495 




Tyr 


Ser 


Asp 


Asp 


Val 


Leu 


Ala 


Asn 


Met 


He 


Ser 


Glu 


Pro 


Arg 


He 


Ser 








500 










505 










510 






Tyr 


Gly 


Asn 


Asp 


Ala 


Leu 


Met 


Pro 


Ser 


Leu 


Thr 


Glu 


Thr 


Lys 


Thr 


Thr 




515 










520 










525 








Val 


Glu 


Leu 


Leu 


Pro 


Val 


Asn 


Gly 


Glu 


Phe 


Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




530 










535 










540 










Pro 


Trp 


His 


Ser 


Phe 


Gly 


Ala 


Asp 


Ser 


Val 


Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


545 










550 










555 










560 


Glu 


Val 


Glu 


Pro 


Val 


Asp 


Ala 


Arg 


Pro 


Ala 


Ala 


Asp 


Arg 


Gly 


Leu 


Thr 










565 










570 










575 




Thr 


Arg 


Pro 


Gly 


Ser 


Gly 


Leu 


Thr 


Asn 


He 


Lys 


Thr 


Glu 


Glu 


He 


Ser 






580 










585 










590 






Glu 


Val 


Lys 


Met 


Asp 


Ala 


Glu 


Phe 


Arg 


His 


Asp 


Ser 


Gly 


Tyr 


Glu 


Val 






595 










600 










605 








His 


His 


Gin 


Lys 


Leu 


Val 


Pne 


Pne 


Ala 


Glu 


Asp 


Val 


Gly 


Ser 


Asn 


Lys 




610 










615 










620 










Gly 


Ala 


He 


He 


Gly 


Leu 


Met 


Val 


Gly 


Gly 


Val 


Val 


He 


Ala 


Thr 


Val 


625 








630 










635 










640 


lie 


Val 


He 


Thr 


Leu 


Val 


Met 


Leu 


Lys 


Lys 


Lys 


Lys 


Leu 


Gly 


Thr 


Glu 










645 










650 










655 




Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


Val 


Trp 


Trp 


Asn 


Ser 


Ala 


Asp 


lie 


Leu 


Glu 






660 










665 










670 






Pro 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


Gly 


Ser 


Gin 


Pro 






675 










680 










685 








Lys 


Thr 


Ala 


Cys 


Thr 


Asn 


Cys 


Tyr 


Cys 


Lys 


Lys 


Cys 


Cys 


Phe 


His 


Cys 




690 










695 










700 










Gin 


Val 


Cys 


Phe 


Met 


Thr 


Lys 


Ala 


Leu 


Gly 


He 


Ser 


Tyr 


Gly 


Arg 


Lys 


705 








710 










715 










720 


Lys 


Arg 


Arg 


Gin 


Arg 


Arg 


Arg 


Ala 


His 


Gin 


Asn 


Ser 


Gin 


Thr 


His 


Gin 
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725 730 735 

Ala Ser Leu Ser Lys Gin Arg lie Ser Ser Thr Val Ala Ala Ala Asp 

740 745 750 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 

755 760 765 

Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
770 775 780 



<210> 13 
<211> 2823 
<212> DNA 

<213> fusion protein - human 



<400> 13 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 

agtccagtgt 

gatatttgcc 

ctgaagaaca 

gcacatctga 

tttcctcgag 

ttgttaacag 

gcttcagtgg 

tcatcggaag 

gatctggccc 

gtggcgatgg 

gattccccgg 

gccgacttcg 

gatatccagc 

aagatgcagc 



gtttggcact 

gtaatgctgg 

acatgaatgt 

ataccaagga 

atgtggtaga 

agtgcaagac 

gtgatgccct 

gcgaaactca 

acttgcatga 

ttgtgtgttg 

atgactcgga 

aagtagtaga 

atgacgagga 

aagccacaga 

aagaggtggt 

tcgagacacc 

aggccaagca 

aagcaaagaa 

tggaatcttt 

ccagagtgga 

ctctgcaggc 

gcgcagaaca 

atcccaagaa 

agcgcatgaa 

aggatgaagt 

acatgattag 

aaacgaaaac 

agccgtggca 

ctgttgatgc 

caaatatcaa 

caggatatga 

aaggtgcaat 

ccttggtgat 

ggtggaattc 

gacttaaaaa 

actgggagtg 

cagaagtgga 

aagaccttga 

gattatttgt 

agactgatat 

agagtagtaa 

ccccgaccga 

cgcatgccga 

gtccgggatt 

agtttgagca 

acagtggcgg 

agaacggcta 



gctcctgctg 

cctgctggct 

ccagaatggg 

aggcatcctg 

agccaaccaa 

ccatccccac 

tctcgttcct 

tcttcactgg 

ctacggcatg 

cccactggct 

tgtctggtgg 

agtagcagag 

cgatgaggat 

gagaaccacc 

tcgagttcct 

tggggatgag 

ccgagagaga 

cttgcctaaa 

ggaacaggaa 

agccatgctc 

tgttcctcct 

gaaggacaga 

agccgctcag 

tcagtctctc 

tgatgagctg 

tgaaccaagg 

caccgtggag 

ttcttttggg 

ccgccctgct 

gacggaggag 

agttcatcat 

cattggactc 

gctgaagaag 

tgcagatatc 

gctcaagtgc 

tcgctactct 

atcaaggcta 

catgattttg 

acaagataat 

gcctctaaca 

caaaggtcaa 

tgtcagcctg 

cgcgctagac 

taccccccac 

gatgtttacc 

ccgcgacgcc 

cgaaaatcca 



gccgcctgga 

gaaccccaga 

aagtgggatt 

cagtattgcc 

ccagtgacca 

tttgtgattc 

gacaagtgca 

cacaccgtcg 

ttgctgccct 

gaagaaagtg 

ggcggagcag 

gaggaagaag 

ggtgatgagg 

agcattgcca 

acaacagcag 

aatgaacatg 

atgtcccagg 

gctgataaga 

gcagccaacg 

aatgaccgcc 

cggcctcgtc 

cagcacaccc 

atccggtccc 

tccctgctct 

cttcagaaag 

atcagttacg 

ctccttcccg 

gctgactctg 

gccgaccgag 

atctctgaag 

caaaaattgg 

atggtgggcg 

aaaaagcttg 

aagctactgt 

tccaaagaaa 

cccaaaacca 

gaaagactgg 

aaaatggatt 

gtgaataaag 

ttgagacagc 

agacagttga 

ggggacgagc 

gatttcgatc 

gactccgccc 

gatgcccttg 

gctgtcaccc 

acctacaagt 



cggctcgggc 

ttgccatgtt 

cagatccatc 

aagaagtcta 

tccagaactg 

cctaccgctg 

aattcttaca 

ccaaagagac 

gcggaattga 

acaatgtgga 

acacagacta 

tggctgaggt 

tagaggaaga 

ccaccaccac 

ccagtacccc 

cccatttcca 

tcatgagaga 

aggcagttat 

agagacagca 

gccgcctggc 

acgtgttcaa 

taaagcattt 

aggttatgac 

acaacgtgcc 

agcaaaacta 

gaaacgatgc 

tgaatggaga 

tgccagccaa 

gactgaccac 

tgaatctaga 

tgttctttgc 

gtgttgtcat 

gtaccgagct 

cttctatcga 

aaccgaagtg 

aaaggtctcc 

aacagctatt 

ctttacagga 

atgccgtcac 

atagaataag 

ctgtatcggg 

tccacttaga 

tggacatgtt 

cctacggcgc 

gaattgacga 

cagaggagcg 

tctttgagca 



gctggaggta 60 
ctgtggcaga 120 
agggaccaaa 180 
ccctgaactg 240 
gtgcaagcgg 300 
cttagttggt 360 
ccaggagagg 420 
atgcagtgag 480 
caagttccga 540 
ttctgctgat 600 
tgcagatggg 660 
ggaagaagaa 720 
ggctgaggaa 780 
caccaccaca 840 
tgatgccgtt 900 
gaaagccaaa 960 
atgggaagag 1020 
ccagcatttc 1080 
gctggtggag 1140 
cctggagaac 12 00 
tatgctaaag 1260 
cgagcatgtg 1320 
acacctccgt 1380 
tgcagtggcc 1440 
ttcagatgac 1500 
tctcatgcca 1560 
gttcagcctg 1620 
cacagaaaac 1680 
tcgaccaggt 1740 
tgcagaattc 1800 
agaagatgtg 1860 
agcgacagtg 1920 
cggatccact 1980 
acaagcatgc 2040 
cgccaagtgt 2100 
gctgactagg 2160 
tctactgatt 2220 
tataaaagca 2280 
agatagattg 2340 
tgcgacatca 2400 
aattcccggg 2460 
cggcgaggac 2520 
gggggacggg 2580 
tctggatatg 2640 
gtacggtggg 2700 
ccacctgtcc 2760 
gatgcagaac 2820 
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Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
930 ~ 935 940 



<210> 15 

<211> 2823 

<212> DNA 

<213> fusion protein 



<400> 15 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 

agtccagtgt 

gatatttgcc 

ctgaagaaca 

gcacatctga 

tttcctcgag 

ttgttaacag 

gcttcagtgg 

tcatcggaag 

gatctggccc 

gtggcgatgg 

gattccccgg 

gccgacttcg 

gatatccagc 

aagatgcagc 

tag 

<210> 16 
<211> 941 
<212> PRT 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
gacttaaaaa 
actgggagtg 
cagaagtgga 
aagaccttga 
gattatttgt 
agactgatat 
agagtagtaa 
ccccgaccga 
cgcatgccga 
gtccgggatt 
agtttgagca 
acagtggcgg 
agaacggcta 



human 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgcagatatc 
gctcaagtgc 
tcgctactct 
atcaaggcta 
catgattttg 
acaagataat 
gcctctaaca 
caaaggtcaa 
tgtcagcctg 
cgcgctagac 
taccccccac 
gatgtttacc 
ccgcgacgcc 
cgaaaatcca 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaaaagcttg 
aagctactgt 
tccaaagaaa 
cccaaaacca 
gaaagactgg 
aaaatggatt 
gtgaataaag 
ttgagacagc 
agacagttga 
ggggacgagc 
gatttcgatc 
gactccgccc 
gatgcccttg 
gctgtcaccc 
acctacaagt 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 
atgccgtcac 
atagaataag 
ctgtatcggg 
tccacttaga 
tggacatgtt 
cctacggcgc 
gaattgacga 
cagaggagcg 
tctttgagca 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 
agatagattg 
tgcgacatca 
aattcccggg 
cggcgaggac 
gggggacggg 
tctggatatg 
gtacggtggg 
ccacctgtcc 
gatgcagaac 
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Asp 


Tyr 










165 








Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 








180 










Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 






195 










200 


Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 




210 










215 




Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


225 










230 






Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 










245 








Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 








260 










Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 






275 










280 


Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 




290 










295 




Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


305 










310 






Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 










325 








Glu 


Trp 


Glu 


Glu 


Ala 


G1U 


Arg 


Gin 








340 










Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 




355 










360 


Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 




370 










375 




Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


385 










390 






Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 








405 








Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 








420 










Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 






435 










440 


Ala 


Gin 


He 


Arg 


Ser 


Gin 


Val 


Met 




450 










455 




Arg 


Met 


Asn 


Gin 


Ser 


Leu 


Ser 


Leu 



Leu 


Leu 


Ala 


Ala 


Trp 


Thr 




Arg 




10 










13 




Asn Ala 


Gly Leu 


Leu 


Aia 


bill 


Fro 


25 










J U 






Leu 


Asn 


Met 


His 


Met 


Asn 


vai 


Gin 










45 








Ser Gly 


Thr Lys 


Thr 


Cys 


lie 


Asp 








60 










Cys 


Gin 


Glu Val 


Tyr 


Pro 


GIU 


Leu 




75 










oO 


Asn 


Gin 


Pro 


Val 


Thr 


lie 


Gin 


Asn 




90 










3d 




Cys 


Lys 


Thr 


His 


Pro 


His 


Pne 


val 


105 










110 






Glu 


Phe 


He 


Ser 


Asp 


Ala 


Leu 


Leu 










125 








His 


Gin 


Glu 


Arg 


Met 


Asp 


Val 


Cys 








140 










Val 


Ala 


Lys 


Glu 


Thr 


Cys 


Ser 


Glu 






155 










160 


Gly Met 


Leu 


Leu 


Pro 


Cys 


Gly 


lie 




170 










175 




Val 


Cys 


Cys 


Pro 


Leu 


Ala 


Glu 


Glu 


185 










190 






Ala 


Glu 


Glu Asp 


Asp 


Ser 


Asp 


vai 










205 








Tyr Ala 


Asp Gly 


Ser 


Glu 


Asp 


Lys 








220 










Glu 


Val 


Ala 


Glu 


Val 


Glu 


Glu 


Glu 






235 










240 


Glu Asp 


Gly Asp 


Glu 


Val 


Glu 


Glu 




250 










o c c 




Ala 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


lie 


265 










270 






Glu 


Ser 


Val 


Glu 


Glu 


Val 


Val 


Arg 










285 








Pro 


Asp 


Ala 


Val 


Asp 


Lys 


Tyr 


Leu 








300 










His 


Ala 


His 


Phe 


Gin 


Lys 


Aia 


Lys 






315 












Glu 


Arg 


Met 


Ser 


Gin 


Val 


Met 


Arg 




330 










O "3 C 
J J J 




Ala 


Lys 


Asn 


Leu 


C ±- KJ 


Lys 


Ala 


Asp 


345 










350 






Gin 


Glu 


Lys 


Val 


Glu 


Ser 


Leu 


Glu 










365 








Gin 


Leu 


Val 


Glu 


Thr 


His 


Met 


Ala 








380 










Arg Arg 


Arg 


Leu 


Ala 


Leu 


Glu 


Asn 






395 










400 


Pro 


Pro 


Arg 


Pro 


Arg 


His 


Val 


Phe 




410 










415 




Ala 


Glu 


Gin 


Lys 


Asp 


Arg 


Gin 


His 


425 










430 






Arg 


Met 


Val 


Asp 


Pro 


Lys 


Lys 


Ala 










445 








Thr 


His 


Leu 


Arg 


Val 


He 


Tyr 


Glu 








460 










Leu 


Tyr 


Asn 


Val 


Pro 


Ala 


Val 


Ala 
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465 470 



Glu 


Glu 


lie 


Gin 


Asp 


Glu Val 


Asp Glu 










485 








Tyr 


Ser 


Asp Asp 


Val 


Leu Ala 


Asn 


Met 








500 








505 


Tyr 


Gly 


Asn Asp 


Ala 


Leu Met 


Pro 


Ser 






515 








520 




Val 


Glu 


Leu 


Leu 


Pro Val Asn 


Gly Glu 




530 








535 






Pro 


Trp 


His 


Ser 


Phe Gly Ala 


Asp 


Ser 


545 










550 






Glu 


Val 


Glu 


Pro 


Val Asp Ala 


Arg 


Pro 










565 








Thr 


Arg 


Pro Gly 


Ser Gly Leu 


Thr 


Asn 








580 








585 


Glu 


Val 


Lys 


Met 


Asp 


Ala Glu 


Phe 


Arg 






595 








600 




His 


His 


Gin 


Lys 


Leu 


Val Phe 


Phe 


Ala 




610 








615 






Gly 


Ala 


He 


He 


Gly Leu Met 


Val 


Gly 


625 










630 






lie 


Val 


He 


Thr 


Leu 


Val Met 


Leu Lys 










645 








Leu 


Gly 


Ser 


Thr 


Ser 


Pro Val 


Trp 


Trp 








660 








665 


Leu 


Ser 


Ser 


He 


Glu 


Gin Ala 


Cys 


Asp 






675 








680 




Lys 


Cys 


Ser 


Lys 


Glu 


Lys Pro 


Lys 


Cys 




690 








695 






Trp 


Glu 


Cys 


Arg 


Tyr 


Ser Pro 


Lys 


Thr 


705 










710 






Ala 


His 


Leu 


Thr 


Glu 


Val Glu 


Ser 


Arg 










725 








Phe 


Leu 


Leu 


He 


Phe 


Pro Arg 


Glu Asp 








740 








745 


Asp 


Ser 


Leu 


Gin 


Asp 


He Lys 


Ala 


Leu 






755 








760 




Asp 


Asn 


Val 


Asn 


Lys Asp Ala 


Val 


Thr 




770 








775 






Thr 


Asp 


Met 


Pro 


Leu 


Thr Leu 


Arg 


Gin 


785 










790 






Ser 


Ser 


Glu 


Glu 


Ser 


Ser Asn 


Lys 


Gly 










805 








Gly 


lie 


Pro 


Gly 


Asp 


Leu Ala 


Pro 


Pro 








820 








825 


Glu 


Leu 


His 


Leu 


Asp 


Gly Glu 


Asp 


Val 






835 








840 




Leu 


Asp 


Asp 


Phe 


Asp 


Leu Asp 


Met 


Leu 




850 








855 






Pro 


Gly 


Phe 


Thr 


Pro 


His Asp 


Ser 


Ala 


865 










870 






Ala 


Asp 


Phe 


Glu 


Phe 


Glu Gin 


Met 


Phe 










885 








Glu 


Tyr 


Gly Gly 


Asp 


He Gin 


His 


Ser 








900 








905 


Val 


Thr 


Pro 


Glu 


Glu 


Arg His 


Leu 


Ser 






915 








920 




Glu 


Asn 


Pro 


Thr 


Tyr Lys Phe 


Phe 


Glu 




930 








935 
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475 










480 


Leu Leu 


Gin 


Lys 


Glu 


Gin 


Asn 


490 








495 




He Ser 


Glu 


Pro 


Arg 


He 


Ser 








510 






Leu Thr 


Glu 


Thr 


Lys 


Thr 


Thr 






525 






Phe Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




540 










Val Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


555 










560 


Ala Ala 


Asp 


Arg 


Gly 


Leu 


Thr 


570 








575 




He Lys 


Thr 


Glu 


Glu 


He 


Ser 








590 






His Asp 


Ser 


Gly 


Tyr 


Glu 


Val 






605 








Glu Asp Val 


Gly 


Ser 


Asn 


Lys 




620 










Gly Val 


Val 


He 


Ala 


Thr 


Val 


635 










640 


Lys Lys 


Lys 


Leu 


Gly 


Thr 


Glu 


650 








655 




Asn Ser 


Ala 


Asp 


He 


Lys 


Leu 








670 






He Cys 


Arg 


Leu 


Lys 


Lys 


Leu 






685 








Ala Lys 


Cys 


Leu 


Lys 


Asn 


Asn 




700 










Lys Arg 


Ser 


Pro 


Leu 


Thr 


Arg 


715 










720 


Leu Glu 


Arg 


Leu 


Glu 


Gin 


Leu 


730 








735 




Leu Asp 


Met 


He 


Leu 


Lys 


Met 








750 






Leu Thr Gly 


Leu 


Phe 


Val 


Gin 






765 








Asp Arg 


Leu 


Ala 


Ser 


Val 


Glu 




780 










His Arg 


He 


Ser 


Ala 


Thr 


Ser 


795 










800 


Gin Arg 


Gin 


Leu 


Thr 


Val 


Ser 


810 








815 




Thr Asp Val 


Ser 


Leu 


Gly 


Asp 








830 






Ala Met 


Ala 


His 


Ala 


Asp 


Ala 






845 








Gly Asp Gly 


Asp 


Ser 


Pro 


Gly 




860 










Pro Tyr 


Gly 


Ala 


Leu 


Asp 


Met 


875 










880 


Thr Asp Ala 


Leu 


Gly 


He 


Asp 


890 








895 




Gly Ala Ala 


Ala 


Asp 


Ala 


Ala 








910 






Lys Met 


Gin 


Gin 


Asn 


Gly 


Tyr 






925 








Gin Met 


Gin 


Asn 









940 
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<211> 3354 
<212> DNA 
<213> human 

<400> 17 

agtttcctcg gcagcggtag gcgagagcac gcggaggagc gtgcgcgggg gccccgggag 60 
acggcggcgg tggcggcgcg ggcagagcaa ggacgcggcg gatcccactc gcacagcagc 120 
gcactcggtg ccccgcgcag ggtcgcgatg ctgcccggtt tggcactgct cctgctggcc 180 
gcctggacgg ctcgggcgct ggaggtaccc actgatggta atgctggcct gctggctgaa 240 
ccccagattg ccatgttctg tggcagactg aacatgcaca tgaatgtcca gaatgggaag 300 
tgggattcag atccatcagg gaccaaaacc tgcattgata ccaaggaagg catcctgcag 3 60 
tattgccaag aagtctaccc tgaactgcag atcaccaatg tggtagaagc caaccaacca 420 
gtgaccatcc agaactggtg caagcggggc cgcaagcagt gcaagaccca tccccacttt 480 
gtgattccct accgctgctt agttggtgag tttgtaagtg atgcccttct cgttcctgac 540 
aagtgcaaat tcttacacca ggagaggatg gatgtttgcg aaactcatct tcactggcac 600 
accgtcgcca aagagacatg cagtgagaag agtaccaact tgcatgacta cggcatgttg 660 
ctgccctgcg gaattgacaa gttccgaggg gtagagtttg tgtgttgccc actggctgaa 720 
gaaagtgaca atgtggattc tgctgatgcg gaggaggatg actcggatgt ctggtggggc 780 
ggagcagaca cagactatgc agatgggagt gaagacaaag tagtagaagt agcagaggag 840 
gaagaagtgg ctgaggtgga agaagaagaa gccgatgatg acgaggacga tgaggatggt 900 
gatgaggtag aggaagaggc tgaggaaccc tacgaagaag ccacagagag aaccaccagc 960 
attgccacca ccaccaccac caccacagag tctgtggaag aggtggttcg agttcctaca 1020 
acagcagcca gtacccctga tgccgttgac aagtatctcg agacacctgg ggatgagaat 1080 
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagagaatg 1140 
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcctaaagct 1200 
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 12 60 
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 1320 
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 1380 
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 1440 
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 1500 
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 1560 
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 1620 
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 1680 
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 1740 
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 1800 
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 1860 
gaccgaggac tgaccactcg accaggttct gggttgacaa atatcaagac ggaggagatc 1920 
tctgaagtga agatggatgc agaattccga catgactcag gatatgaagt tcatcatcaa 1980 
aaattggtgt tctttgcaga agatgtgggt tcaaacaaag gtgcaatcat tggactcatg 2040 
gtgggcggtg ttgtcatagc gacagtgatc gtcatcacct tggtgatgct gaagaagaaa 2100 
cagtacacat ccattcatca tggtgtggtg gaggttgacg ccgctgtcac cccagaggag 2160 
cgccacctgt ccaagatgca gcagaacggc tacgaaaatc caacctacaa gttctttgag 2220 
cagatgcaga actagacccc cgccacagca gcctctgaag ttggacagca aaaccattgc 2280 
ttcactaccc atcggtgtcc atttatagaa taatgtggga agaaacaaac ccgttttatg 2340 
atttactcat tatcgccttt tgacagctgt gctgtaacac aagtagatgc ctgaacttga 2400 
attaatccac acatcagtaa tgtattctat ctctctttac attttggtct ctatactaca 2460 
ttattaatgg gttttgtgta ctgtaaagaa tttagctgta tcaaactagt gcatgaatag 2520 
attctctcct gattatttat cacatagccc cttagccagt tgtatattat tcttgtggtt 2580 
tgtgacccaa ttaagtccta ctttacatat gctttaagaa tcgatggggg atgcttcatg 2640 
tgaacgtggg agttcagctg cttctcttgc ctaagtattc ctttcctgat cactatgcat 2700 
tttaaagtta aacattttta agtatttcag atgctttaga gagatttttt ttccatgact 2760 
gcattttact gtacagattg ctgcttctgc tatatttgtg atataggaat taagaggata 2820 
cacacgtttg tttcttcgtg cctgttttat gtgcacacat taggcattga gacttcaagc 2880 
ttttcttttt ttgtccacgt atctttgggt ctttgataaa gaaaagaatc cctgttcatt 2940 
gtaagcactt ttacggggcg ggtggggagg ggtgctctgc tggtcttcaa ttaccaagaa 3000 
ttctccaaaa caattttctg caggatgatt gtacagaatc attgcttatg acatgatcgc 3060 
tttctacact gtattacata aataaattaa ataaaataac cccgggcaag acttttcttt 3120 
gaaggatgac tacagacatt aaataatcga agtaattttg ggtggggaga agaggcagat 3180 
tcaattttct ttaaccagtc tgaagtttca tttatgatac aaaagaagat gaaaatggaa 3240 
gtggcaatat aaggggatga ggaaggcatg cctggacaaa cccttctttt aagatgtgtc 3300 
ttcaatttgt ataaaatggt gttttcatgt aaataaatac attcttggag gage 3354 
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<210> 18 
<211> 695 
<212> PRT 
<213> human 



<400> 18 



Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu Leu Leu Leu Ala Ala 


Trp 


Thr 


Ala 


Arg 


1 








5 




10 




15 


Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp Gly Asn Ala Gly Leu 


Leu 


Ala 


Glu 


Pro 








20 






25 




30 






Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly Arg Leu Asn Met His 


Met 


Asn 


Val 


Gin 






35 








40 


45 








Asn 


Gly 


Lys 


Trp 


Asp 


Ser 


Asp Pro Ser Gly Thr Lys 


Thr 


Cys 


He 


Asp 




50 










55 60 










Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin Tyr Cys Gin Glu Val 


Tyr 


Pro 


Glu 


Leu 


65 










70 


75 








80 


Gin 


He 


Thr 


Asn 


Val 


Val 


Glu Ala Asn Gin Pro Val 


Thr 


He 


Gin 


Asn 










85 




90 






95 




Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys Gin Cys Lys Thr His 


Pro 


His 


Phe 


Val 








100 






105 




110 






He 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val Gly Glu Phe Val Ser 


Asp 


Ala 


Leu 


Leu 






115 








120 


125 








Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe Leu His Gin Glu Arg 


Met 


Asp 


Val 


Cys 




130 










135 140 










Glu 


Thr 


His 


Leu 


His 


Trp 


His Thr Val Ala Lys Glu 


Thr 


Cys 


Ser 


Glu 


145 










150 


155 








160 


Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp Tyr Gly Met Leu Leu 


Pro 


Cys 


Gly 


He 










165 




170 






175 




Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu Phe Val Cys Cys Pro 


Leu 


Ala 


Glu 


Glu 








180 






185 




190 






Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala Asp Ala Glu Glu Asp 


Asp 


Ser 


Asp 


Val 






195 








200 


205 








Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr Asp Tyr Ala Asp Gly 


Ser 


Glu 


Asp 


Lys 




210 










215 220 










Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu Glu Glu Val Ala Glu 


Val 


Glu 


Glu 


Glu 


225 










230 


235 








240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp Asp Glu Asp Gly Asp 


Glu 


Val 


Glu 


Glu 










245 




250 






255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu Glu Ala Thr Glu Arg 


Thr 


Thr 


Ser 


He 








260 






265 




270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr Thr Glu Ser Val Glu 


Glu 


Val 


Val 


Arg 






275 








280 


285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser Thr Pro Asp Ala Val 


Asp 


Lys 


Tyr 


Leu 




290 










295 300 










Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn Glu His Ala His Phe 


Gin 


Lys 


Ala 


Lys 


305 










310 


315 






320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His Arg Glu Arg Met Ser 


Gin 


Val 


Met 


Arg 










325 




330 






335 




Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg Gin Ala Lys Asn Leu 


Pro 


Lys 


Ala 


Asp 








340 






345 




350 






Lys 


Lys 


Ala 


Val 


He 


Gin 


His Phe Gin Glu Lys Val 


Glu 


Ser 


Leu 


Glu 






355 








360 


365 








Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg Gin Gin Leu Val Glu 


Thr 


His 


Met 


Ala 




370 










375 380 










Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn Asp Arg Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


385 










390 


395 








400 


Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala Val Pro Pro Arg Pro 


Arg 


His 


Val 


Phe 










405 




410 






415 




Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val Arg Ala Glu Gin Lys 


Asp 


Arg 


Gin 


His 








420 






425 




430 






Thr 


Leu 


Lys 


His 


Phe 


Glu 


His Val Arg Met Val Asp 


Pro 


Lys 


Lys 


Ala 






435 








440 


445 
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Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 

450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 

515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu He Ser 

580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 

610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 

675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 

<210> 19 
<211> 3972 
<212> DNA 
<213> human 

<400> 19 

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120 
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 
ctcgtgacca ccttcaccta cggcgtgcag tgcttcgccc gctaccccga ccacatgaag 240 
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420 
aagctggagt acaactacaa cagccacaag gtctatatca ccgccgacaa gcagaagaac 480 
ggcatcaagg tgaacttcaa gacccgccac aacatcgagg acggcagcgt gcagctcgcc 540 
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 
ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 720 
ctcgagtcta gagggcccgt ttaaacccgc tgatcagcct cgactgtgcc ttctagttgc 780 
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 840 
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 900 
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 960 
catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag catgtgagca 1020 
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 1080 
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 1140 
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 1200 
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 1260 
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 1320 
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tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 1380 
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 1440 
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 1500 
tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 1560 
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 1620 
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 1680 
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgacatta 1740 
acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg tgatgacggt 1800 
gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc 1860 
gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt 1920 
aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg 1980 
cacagatgcg taaggagaaa ataccgcatc aggacgcgcc ctgtagcggc gcattaagcg 2040 
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 2100 
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 2160 
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 2220 
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 2280 
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 2340 
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 2400 
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 2460 
ttacaatttc cattcgccat tcaggctgaa ctagatctag agtccgttac ataacttacg 2520 
gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 2580 
tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 2640 
cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 2700 
gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 2760 
tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 2820 
tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 2 880 
cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 2940 
cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgfcacggtg ggaggtctat 3000 
ataagcagag ctcgtttagt gaaccgtcag atcgcctgga gacgccatcc acgctgtttt 3060 
gacctccata gaagacaccg ggaccgatcc agcctccgcg gccgggaacg gtgcattgga 3120 
acggaccgtg ttgacaatta atcatcggca tagtatatcg gcatagtata atacgacaag 3180 
gtgaggaact aaaccatggc caagttgacc agtgccgttc cggtgctcac cgcgcgcgac 3240 
gtcgccggag cggtcgagtt ctggaccgac cggctcgggt tctcccggga cttcgtggag 3300 
gacgacttcg ccggtgtggt ccgggacgac gtgaccctgt tcatcagcgc ggtccaggac 3360 
caggtggtgc cggacaacac cctggcctgg gtgtgggtgc gcggcctgga cgagctgtac 3420 
gccgagtggt cggaggtcgt gtccacgaac ttccgggacg cctccgggcc ggccatgacc 3480 
gagatcggcg agcagccgtg ggggcgggag ttcgccctgc gcgacccggc cggcaactgc 3540 
gtgcacttcg tggccgagga gcaggactga cactcgacct cgaaacttgt ttattgcagc 3 600 
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 3 660 
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatcga 3720 
tacttcaaga actgctgaca tcgagcttgc tacaagggac tttccgctgg ggactttcca 3780 
gggaggcgtg gcctgggcgg gactggggag tggcgagccc tcagatcctg catataagca 3840 
gctgcttttt gcctgtactg ggtctctctg gttagaccag atctgagcct gggagctctc 3900 
tggctaacta gggaacccac tgcttaagcc tcaataaagc ttggtaccga gctcggatcc .3960 
gaattcgcca cc 3972 

<210> 20 
<211> 3972 
<212> DNA 
<213> human 



taccactcgt tcccgctcct cgacaagtgg ccccaccacg ggtaggacca gctcgacctg 60 
ccgctgcatt tgccggtgtt caagtcgcac aggccgctcc cgctcccgct acggtggatg 120 
ccgttcgact gggacttcaa gtagacgtgg tggccgttcg acgggcacgg gaccgggtgg 180 
gagcactggt ggaagtggat gccgcacgtc acgaagcggg cgatggggct ggtgtacttc 240 
gtcgtgctga agaagttcag gcggtacggg cttccgatgc aggtcctcgc gtggtagaag 300 
aagttcctgc tgccgttgat gttctgggcg cggctccact tcaagctccc gctgtgggac 360 
cacttggcgt agctcgactt cccgtagctg aagttcctcc tgccgttgta ggaccccgtg 420 
ttcgacctca tgttgatgtt gtcggtgttc cagatatagt ggcggctgtt cgtcttcttg 480 
ccgtagttcc acttgaagtt ctgggcggtg ttgtagctcc tgccgtcgca cgtcgagcgg 540 
ctggtgatgg tcgtcttgtg ggggtagccg ctgccggggc acgacgacgg gctgttggtg 600 
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atggactcgt gggtcaggcg ggactcgttt ctggggttgc tcttcgcgct agtgtaccag 660 
gacgacctca agcactggcg gcggccctag tgagagccgt acctgctcga catgttcatt 720 
gagctcagat ctcccgggca aatttgggcg actagtcgga gctgacacgg aagatcaacg 780 
gtcggtagac aacaaacggg gagggggcac ggaaggaact gggaccttcc acggtgaggg 840 
tgacaggaaa ggattatttt actcctttaa cgtagcgtaa cagactcatc cacagtaaga 900 
taagaccccc caccccaccc cgtcctgtcg ttccccctcc taacccttct gttatcgtcc 960 
gtacgacccc tacgccaccc gagataccga agactccgcc tttcttggtc gtacactcgt 1020 
tttccggtcg ttttccggtc cttggcattt ttccggcgca acgaccgcaa aaaggtatcc 1080 
gaggcggggg gactgctcgt agtgttttta gctgcgagtt cagtctccac cgctttgggc 1140 
tgtcctgata tttctatggt ccgcaaaggg ggaccttcga gggagcacgc gagaggacaa 1200 
ggctgggacg gcgaatggcc tatggacagg cggaaagagg gaagcccttc gcaccgcgaa 1260 
agagtatcga gtgcgacatc catagagtca agccacatcc agcaagcgag gttcgacccg 132 0 
acacacgtgc ttggggggca agtcgggctg gcgacgcgga ataggccatt gatagcagaa 1380 
ctcaggttgg gccattctgt gctgaatagc ggtgaccgtc gtcggtgacc attgtcctaa 1440 
tcgtctcgct ccatacatcc gccacgatgt ctcaagaact tcaccaccgg attgatgccg 1500 
atgtgatctt cttgtcataa accatagacg cgagacgact tcggtcaatg gaagcctttt 1560 
tctcaaccat cgagaactag gccgtttgtt tggtggcgac catcgccacc aaaaaaacaa 1620 
acgttcgtcg tctaatgcgc gtcttttttt cctagagttc ttctaggaaa ctagaaaaga 1680 
tgccccagac tgcgagtcac cttgcttttg agtgcaattc cctaaaacca gtactgtaat 1740 
tggatatttt tatccgcata gtgctccggg aaagcagagc gcgcaaagcc actactgcca 1800 
cttttggaga ctgtgtacgt cgagggcctc tgccagtgtc gaacagacat tcgcctacgg 1860 
ccctcgtctg ttcgggcagt cccgcgcagt cgcccacaac cgcccacagc cccgaccgaa 1920 
ttgatacgcc gtagtctcgt ctaacatgac tctcacgtgg tatacgccac actttatggc 1980 
gtgtctacgc attcctcttt tatggcgtag tcctgcgcgg gacatcgccg cgtaattcgc 2040 
gccgcccaca ccaccaatgc gcgtcgcact ggcgatgtga acggtcgcgg gatcgcgggc 2100 
gaggaaagcg aaagaaggga aggaaagagc ggtgcaagcg gccgaaaggg gcagttcgag 2160 
atttagcccc cgagggaaat cccaaggcta aatcacgaaa tgccgtggag ctggggtttt 222 0 
ttgaactaat cccactacca agtgcatcac ccggtagcgg gactatctgc caaaaagcgg 2280 
gaaactgcaa cctcaggtgc aagaaattat cacctgagaa caaggtttga ccttgttgtg 2340 
agttgggata gagccagata agaaaactaa atattcccta aaacggctaa agccggataa 2400 
ccaatttttt actcgactaa attgttttta aattgcgctt aaaattgttt tataattgcg 2460 
aatgttaaag gtaagcggta agtccgactt gatctagatc tcaggcaatg tattgaatgc 2520 
catttaccgg gcggaccgac tggcgggttg ctgggggcgg gtaactgcag ttattactgc 2580 
atacaagggt atcattgcgg ttatccctga aaggtaactg cagttaccca cctcataaat 2640 
gccatttgac gggtgaaccg tcatgtagtt cacatagtat acggttcatg cgggggataa 2700 
ctgcagttac tgccatttac cgggcggacc gtaatacggg tcatgtactg gaataccctg 2760 
aaaggatgaa ccgtcatgta gatgcataat cagtagcgat aatggtacca ctacgccaaa 2820 
accgtcatgt agttacccgc acctatcgcc aaactgagtg cccctaaagg ttcagaggtg 2880 
gggtaactgc agttaccctc aaacaaaacc gtggttttag ttgccctgaa aggttttaca 2940 
gcattgttga ggcggggtaa ctgcgtttac ccgccatccg cacatgccac cctccagata 3000 
tattcgtctc gagcaaatca cttggcagtc tagcggacct ctgcggtagg tgcgacaaaa 3060 
ctggaggtat cttctgtggc cctggctagg tcggaggcgc cggcccttgc cacgtaacct 3120 
tgcctggcac aactgttaat tagtagccgt atcatatagc cgtatcatat tatgctgttc 3180 
cactccttga tttggtaccg gttcaactgg tcacggcaag gccacgagtg gcgcgcgctg 3240 
cagcggcctc gccagctcaa gacctggctg gccgagccca agagggccct gaagcacctc 3300 
ctgctgaagc ggccacacca ggccctgctg cactgggaca agtagtcgcg ccaggtcctg 33 60 
gtccaccacg gcctgttgtg ggaccggacc cacacccacg cgccggacct gctcgacatg 3420 
cggctcacca gcctccagca caggtgcttg aaggccctgc ggaggcccgg ccggtactgg 3480 
ctctagccgc tcgtcggcac ccccgccctc aagcgggacg cgctgggccg gccgttgacg 3540 
cacgtgaagc accggctcct cgtcctgact gtgagctgga gctttgaaca aataacgtcg 3600 
aatattacca atgtttattt cgttatcgta gtgtttaaag tgtttatttc gtaaaaaaag 3 660 
tgacgtaaga tcaacaccaa acaggtttga gtagttacat agaatagtac agacctagct 3720 
atgaagttct tgacgactgt agctcgaacg atgttccctg aaaggcgacc cctgaaaggt 3780 
ccctccgcac cggacccgcc ctgacccctc accgctcggg agtctaggac gtatattcgt 3840 
cgacgaaaaa cggacatgac ccagagagac caatctggtc tagactcgga ccctcgagag 3900 
accgattgat cccttgggtg acgaattcgg agttatttcg aaccatggct cgagcctagg 3960 
cttaagcggt gg 3972 

<210> 21 
<211> 7153 
<212> DNA 
<213> human 
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<400> 21 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 72 0 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 

gtttaaactt aagcttcccc gcgcagggtc gcgatgctgc ccggtttggc actgctcctg 960 

ctggccgcct ggacggctcg ggcgctggag gtacccactg atggtaatgc tggcctgctg 1020 

gctgaacccc agattgccat gttctgtggc agactgaaca tgcacatgaa tgtccagaat 10 80 

gggaagtggg attcagatcc atcagggacc aaaacctgca ttgataccaa ggaaggcatc 1140 

ctgcagtatt gccaagaagt ctaccctgaa ctgcagatca ccaatgtggt agaagccaac 1200 

caaccagtga ccatccagaa ctggtgcaag cggggccgca agcagtgcaa gacccatccc 1260 

cactttgtga ttccctaccg ctgcttagtt ggtgagttta taagtgatgc ccttctcgtt 1320 

cctgacaagt gcaaattctt acaccaggag aggatggatg tttgcgaaac tcatcttcac 1380 

tggcacaccg tcgccaaaga gacatgcagt gagaagagta ccaacttgca tgactacggc 1440 

atgttgctgc cctgcggaat tgacaagttc cgaggggtag agtttgtgtg ttgcccactg 1500 

gctgaagaaa gtgacaatgt ggattctgct gatgcggagg aggatgactc ggatgtctgg 1560 

tggggcggag cagacacaga ctatgcagat gggagtgaag acaaagtagt agaagtagca 1620 

gaggaggaag aagtggctga ggtggaagaa gaagaagccg atgatgacga ggacgatgag 1680 

gatggtgatg aggtagagga agaggctgag gaaccctacg aagaagccac agagagaacc 1740 

accagcattg ccaccaccac caccaccacc acagagtctg tggaagaggt ggttcgagtt 1800 

cctacaacag cagccagtac ccctgatgcc gttgacaagt atctcgagac acctggggat 1860 

gagaatgaac atgcccattt ccagaaagcc aaagagaggc ttgaggccaa gcaccgagag 1920 

agaatgtccc aggtcatgag agaatgggaa gaggcagaac gtcaagcaaa gaacttgcct 1980 

aaagctgata agaaggcagt tatccagcat ttccaggaga aagtggaatc tttggaacag 2040 

gaagcagcca acgagagaca gcagctggtg gagacacaca tggccagagt ggaagccatg 2100 

ctcaatgacc gccgccgcct ggccctggag aactacatca ccgctctgca ggctgttcct 2160 

cctcggcctc gtcacgtgtt caatatgcta aagaagtatg tccgcgcaga acagaaggac 2220 

agacagcaca ccctaaagca tttcgagcat gtgcgcatgg tggatcccaa gaaagccgct 2280 

cagatccggt cccaggttat gacacacctc cgtgtgattt atgagcgcat gaatcagtct 2340 

ctctccctgc tctacaacgt gcctgcagtg gccgaggaga ttcaggatga agttgatgag 2400 

ctgcttcaga aagagcaaaa ctattcagat gacgtcttgg ccaacatgat tagtgaacca 2460 

aggatcagtt acggaaacga tgctctcatg ccatctttga ccgaaacgaa aaccaccgtg 2520 

gagctccttc ccgtgaatgg agagttcagc ctggacgatc tccagccgtg gcattctttt 2580 

ggggctgact ctgtgccagc caacacagaa aacgaagttg agcctgttga tgcccgccct 2640 

gctgccgacc gaggactgac cactcgacca ggttctgggt tgacaaatat caagacggag 2700 

gagatctctg aagtgaatct agatgcagaa ttccgacatg actcaggata tgaagttcat 2760 

catcaaaaat tggtgttctt tgcagaagat gtgggttcaa acaaaggtgc aatcattgga 2820 

ctcatggtgg gcggtgttgt catagcgaca gtgatcgtca tcaccttggt gatgctgaag 2880 

aagaaagata tcatggagcc agtagatcct agactagagc cctggaagca tccaggaagt 2940 

cagcctaaaa ctgcttgtac caattgctat tgtaaaaagt gttgctttca ttgccaagtt 3000 

tgtttcatga caaaagcctt aggcatctcc tatggcagga agaagcggag acagcgacga 3060 

agagctcatc agaacagtca gactcatcaa gcttctctat caaagcagta agtaggcggc 3120 

cgctcgagtc tagagggccc gtttaaaccc gctgatcagc ctcgactgtg ccttctagtt 3180 

gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc 3240 

ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt 3300 

ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca 3360 

ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc agctggggct 3420 

ctagggggta tccccacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta 3480 

cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc 3540 

cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg ggcatccctt 3600 
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tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg 3660 

gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca 3720 

cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct 3780 

attcttttga tttataaggg attttgggga tttcggccta ttggttaaaa aatgagctga 3840 

tttaacaaaa atttaacgcg aattaattct gtggaatgtg tgtcagttag ggtgtggaaa 3900 

gtccccaggc tccccaggca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa 3960 

ccaggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc atgcatctca 4020 

attagtcagc aaccatagtc ccgcccctaa ctccgcccat cccgccccta actccgccca 4080 

gttccgccca ttctccgccc catggctgac taattttttt tatttatgca gaggccgagg 4140 

ccgcctctgc ctctgagcta ttccagaagt agtgaggagg cttttttgga ggcctaggct 4200 

tttgcaaaaa gctcccggga gcttgtatat ccattttcgg atctgatcag cacgtgttga 4260 

caattaatca tcggcatagt atatcggcat agtataatac gacaaggtga ggaactaaac 4320 

catggccaag ttgaccagtg ccgttccggt gctcaccgcg cgcgacgtcg ccggagcggt 4380 

cgagttctgg accgaccggc tcgggttctc ccgggacttc gtggaggacg acttcgccgg 4440 

tgtggtccgg gacgacgtga ccctgttcat cagcgcggtc caggaccagg tggtgccgga 4500 

caacaccctg gcctgggtgt gggtgcgcgg cctggacgag ctgtacgccg agtggtcgga 4560 

ggtcgtgtcc acgaacttcc gggacgcctc cgggccggcc atgaccgaga tcggcgagca 4620 

gccgtggggg cgggagttcg ccctgcgcga cccggccggc aactgcgtgc acttcgtggc 4680 

cgaggagcag gactgacacg tgctacgaga tttcgattcc accgccgcct tctatgaaag 4740 

gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 4800 

catgctggag ttcttcgccc accccaactt gtttattgca gcttataatg gttacaaata 4860 

aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4920 

tttgtccaaa ctcatcaatg tatcttatca tgtctgtata ccgtcgacct ctagctagag 4980 

cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc 5040 

acacaacata cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta 5100 

actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca 5160 

gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc 5220 

cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 5280 

tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 5340 

gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 5400 

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 5460 

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 5520 

tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 5580 

ggcgctttct caatgctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 5640 

gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 5700 

tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 5760 

caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 5820 

ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt 5880 

cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 5940 

ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 6000 

cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 6060 

gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc 6120 

aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc 6180 

acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta 6240 

gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga 6300 

cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg 6360 

cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc 6420 

tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat 6480 

cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag 6540 

gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat 6600 

cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa 6660 

ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa 6720 

gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga 6780 

taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg 6840 

gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc 6900 

acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg 6960 

aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact 7020 

cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat 7080 

atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt 7140 

gccacctgac gtc 7153 
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<210> 22 
<211> 7153 
<212> DNA 
<213> human 



<400> 22 

ctgcctagcc 

ggcgtatcaa 

gctcgtttta 

aatcccaatc 

ctaataactg 

acctcaaggc 

gggcgggtaa 

taactgcagt 

tagtatacgg 

tacgggtcat 

agcgataatg 

tgagtgcccc 

ttttagttgc 

catccgcaca 

gacgaatgac 

caaatttgaa 

gaccggcgga 

cgacttgggg 

cccttcaccc 

gacgtcataa 

gttggtcact 

gtgaaacact 

ggactgttca 

accgtgtggc 

tacaacgacg 

cgacttcttt 

accccgcctc 

ctcctccttc 

ctaccactac 

tggtcgtaac 

ggatgttgtc 

ctcttacttg 

tcttacaggg 

tttcgactat 

cttcgtcggt 

gagttactgg 

ggagccggag 

tctgtcgtgt 

gtctaggcca 

gagagggacg 

gacgaagtct 

tcctagtcaa 

ctcgaggaag 

ccccgactga 

cgacggctgg 

ctctagagac 

gtagttttta 

gagtaccacc 

ttctttctat 

gtcggatttt 

acaaagtact 

tctcgagtag 

gcgagctcag 

cggtcggtag 

ggtgacagga 



ctctagaggg 
ttcggtcata 
aattcgatgt 
cgcaaaacgc 
atcaataatt 
gcaatgtatt 
ctgcagttat 
tacccacctg 
ttcatgcggg 
gtactggaat 
gtaccactac 
taaaggttca 
cctgaaaggt 
tgccaccctc 
cgaatagctt 
ttcgaagggg 
cctgccgagc 
tctaacggta 
taagtctagg 
cggttcttca 
ggtaggtctt 
aagggatggc 
cgtttaagaa 
agcggtttct 
ggacgcctta 
cactgttaca 
gtctgtgtct 
ttcaccgact 
tccatctcct 
ggtggtggtg 
gtcggtcatg 
tacgggtaaa 
tccagtactc 
tcttccgtca 
tgctctctgt 
cggcggcgga 
cagtgcacaa 
gggatttcgt 
gggtccaata 
agatgttgca 
ttctcgtttt 
tgcctttgct 
ggcacttacc 
gacacggtcg 
ctcctgactg 
ttcacttaga 
accacaagaa 
cgccacaaca 
agtacctcgg 
gacgaacatg 
gttttcggaa 
tcttgtcagt 
atctcccggg 
acaacaaacg 
aaggattatt 



ctaggggata 
gacgagggac 
tgttccgttc 
gacgaagcgc 
atcattagtt 
gaatgccatt 
tactgcatac 
ataaatgcca 
ggataactgc 
accctgaaag 
gccaaaaccg 
gaggtggggt 
tttacagcat 
cagatatatt 
taattatgct 
cgcgtcccag 
ccgcgacctc 
caagacaccg 
tagtccctgg 
gatgggactt 
gaccacgttc 
gacgaatcaa 
tgtggtcctc 
ctgtacgtca 
actgttcaag 
cctaagacga 
gatacgtcta 
ccaccttctt 
tctccgactc 
gtggtggtgg 
gggactacgg 
ggtctttcgg 
tcttaccctt 
ataggtcgta 
cgtcgaccac 
ccgggacctc 
gttatacgat 
aaagctcgta 
ctgtgtggag 
cggacgtcac 
gataagtcta 
acgagagtac 
tctcaagtcg 
gttgtgtctt 
gtgagctggt 
tctacgtctt 
acgtcttcta 
gtatcgctgt 
tcatctagga 
gttaacgata 
tccgtagagg 
ctgagtagtt 
caaatttggg 
gggagggggc 
ttactccttt 



c cage t gaga 
gaacacacaa 
cgaactggct 
tacatgcccg 
aatgccccag 
tacegggegg 
aagggtatca 
tttgacgggt 
agttactgcc 
gatgaaccgt 
tcatgtagtt 
aactgeagtt 
tgttgaggcg 
cgtctcgaga 
gagtgatatc 
cgctacgacg 
catgggtgac 
tctgacttgt 
ttttggacgt 
gaegtctagt 
gccccggcgt 
ccactcaaat 
tcctacctac 
ctcttctcat 
gctccccatc 
ctacgcctcc 
ccctcacttc 
cttcttcggc 
cttgggatgc 
tgtctcagac 
caactgttca 
tttctctccg 
ctccgtcttg 
aaggtcctct 
ctctgtgtgt 
ttgatgtagt 
ttcttcatac 
cacgcgtacc 
gcaeactaaa 
cggctcctct 
ctgcagaacc 
ggtagaaact 
gaectgetag 
ttgettcaac 
ccaagaccca 
aaggctgtac 
cacccaagtt 
cactagcagt 
tctgatctcg 
acatttttca 
ataccgtcct 
cgaagagata 
cgactagtcg 
aeggaaggaa 
aacgtagcgt 



gtcatgttag 
cctccagcga 
gttaaegtae 
gtctatatgc 
taatcaagta 
accgactggc 
ttgcggttat 
gaacegtcat 
atttaceggg 
catgtagatg 
acccgcacct 
accctcaaac 
gggtaactgc 
gaccgattga 
cctctgggtt 
ggccaaaccg 
taccattacg 
acgtgtactt 
aactatggtt 
ggttacacca 
tegtcaegtt 
attcactacg 
aaacgctttg 
ggttgaacgt 
tcaaacacac 
tcctactgag 
tgtttcatca 
tactactget 
ttcttcggtg 
accttctcca 
tagagctctg 
aactceggtt 
cagttcgttt 
ttcaccttag 
accggtctca 
ggcgagacgt 
aggegegtet 
acctagggtt 
tactcgegta 
aagtcctact 
ggttgtacta 
ggctttgett 
aggteggcac 
teggacaact 
actgtttata 
tgagtcctat 
tgtttccacg 
agtggaacca 
ggaccttcgt 
caacgaaagt 
tcttcgcctc 
gtttegtcat 
gagctgacac 
ctgggacctt 
aacagactca 



acgagactac 
ctcatcacgc 
ttcttagacg 
gcaactgtaa 
tegggtatat 
gggttgctgg 
ccctgaaagg 
gtagttcaca 
eggacegtaa 
cataatcagt 
atcgccaaac 
aaaaccgtgg 
gtttacccgc 
tctcttgggt 
cgaccgatcg 
tgacgaggac 
accggacgac 
acaggtctta 
ccttccgtag 
tcttcggttg 
ctgggtaggg 
ggaagagcaa 
agtagaagtg 
actgatgecg 
aacgggtgac 
cctacagacc 
tettcategt 
cctgctactc 
tctctcttgg 
ccaagctcaa 
tggaccccta 
cgtggctctc 
ettgaaegga 
aaaccttgtc 
ccttcggtac 
ccgacaagga 
tgtcttcctg 
ettteggega 
cttagtcaga 
tcaactactc 
atcacttggt 
ttggtggcac 
cgtaagaaaa 
aegggeggga 
gttctgcctc 
acttcaagta 
ttagtaacct 
ctacgacttc 
aggtccttca 
aacggttcaa 
tgtegctget 
tcatccgccg 
ggaagatcaa 
ecaeggtgag 
tccacagtaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 
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gataagaccc cccaccccac cccgtcctgt cgttccccct cctaaccctt ctgttatcgt 3360 
ccgtacgacc cctacgccac ccgagatacc gaagactccg cctttcttgg tcgaccccga 3420 
gatcccccat aggggtgcgc gggacatcgc cgcgtaattc gcgccgccca caccaccaat 3480 
gcgcgtcgca ctggcgatgt gaacggtcgc gggatcgcgg gcgaggaaag cgaaagaagg 3540 
gaaggaaaga gcggtgcaag cggccgaaag gggcagttcg agatttagcc ccgtagggaa 3600 
atcccaaggc taaatcacga aatgccgtgg agctggggtt ttttgaacta atcccactac 3660 
caagtgcatc acccggtagc gggactatct gccaaaaagc gggaaactgc aacctcaggt 3720 
gcaagaaatt atcacctgag aacaaggttt gaccttgttg tgagttggga tagagccaga 3780 
taagaaaact aaatattccc taaaacccct aaagccggat aaccaatttt ttactcgact 3840 
aaattgtttt taaattgcgc ttaattaaga caccttacac acagtcaatc ccacaccttt 3900 
caggggtccg aggggtccgt ccgtcttcat acgtttcgta cgtagagtta atcagtcgtt 3960 
ggtccacacc tttcaggggt ccgaggggtc gtccgtcttc atacgtttcg tacgtagagt 4020 
taatcagtcg ttggtatcag ggcggggatt gaggcgggta gggcggggat tgaggcgggt 4080 
caaggcgggt aagaggcggg gtaccgactg attaaaaaaa ataaatacgt ctccggctcc 4140 
ggcggagacg gagactcgat aaggtcttca tcactcctcc gaaaaaacct ccggatccga 4200 
aaacgttttt cgagggccct cgaacatata ggtaaaagcc tagactagtc gtgcacaact 4260 
gttaattagt agccgtatca tatagccgta tcatattatg ctgttccact ccttgatttg 4320 
gtaccggttc aactggtcac ggcaaggcca cgagtggcgc gcgctgcagc ggcctcgcca 4380 
gctcaagacc tggctggccg agcccaagag ggccctgaag cacctcctgc tgaagcggcc 4440 
acaccaggcc ctgctgcact gggacaagta gtcgcgccag gtcctggtcc accacggcct 4500 
gttgtgggac cggacccaca cccacgcgcc ggacctgctc gacatgcggc tcaccagcct 4560 
ccagcacagg tgcttgaagg ccctgcggag gcccggccgg tactggctct agccgctcgt 462 0 
cggcaccccc gccctcaagc gggacgcgct gggccggccg ttgacgcacg tgaagcaccg 4680 
gctcctcgtc ctgactgtgc acgatgctct aaagctaagg tggcggcgga agatactttc 4740 
caacccgaag ccttagcaaa aggccctgcg gccgacctac taggaggtcg cgcccctaga 4800 
gtacgacctc aagaagcggg tggggttgaa caaataacgt cgaatattac caatgtttat 4860 
ttcgttatcg tagtgtttaa agtgtttatt tcgtaaaaaa agtgacgtaa gatcaacacc 4920 
aaacaggttt gagtagttac atagaatagt acagacatat ggcagctgga gatcgatctc 4980 
gaaccgcatt agtaccagta tcgacaaagg acacacttta acaataggcg agtgttaagg 5040 
tgtgttgtat gctcggcctt cgtatttcac atttcggacc ccacggatta ctcactcgat 5100 
tgagtgtaat taacgcaacg cgagtgacgg gcgaaaggtc agccctttgg acagcacggt 5160 
cgacgtaatt acttagccgg ttgcgcgccc ctctccgcca aacgcataac ccgcgagaag 5220 
gcgaaggagc gagtgactga gcgacgcgag ccagcaagcc gacgccgctc gccatagtcg 5280 
agtgagtttc cgccattatg ccaataggtg tcttagtccc ctattgcgtc ctttcttgta 5340 
cactcgtttt ccggtcgttt tccggtcctt ggcatttttc cggcgcaacg accgcaaaaa 5400 
ggtatccgag gcggggggac tgctcgtagt gtttttagct gcgagttcag tctccaccgc 5460 
tttgggctgt cctgatattt ctatggtccg caaaggggga ccttcgaggg agcacgcgag 5520 
aggacaaggc tgggacggcg aatggcctat ggacaggcgg aaagagggaa gcccttcgca 5580 
ccgcgaaaga gttacgagtg cgacatccat agagtcaagc cacatccagc aagcgaggtt 5640 
cgacccgaca cacgtgcttg gggggcaagt cgggctggcg acgcggaata ggccattgat 5700 
agcagaactc aggttgggcc attctgtgct gaatagcggt gaccgtcgtc ggtgaccatt 5760 
gtcctaatcg tctcgctcca tacatccgcc acgatgtctc aagaacttca ccaccggatt 5820 
gatgccgatg tgatcttcct gtcataaacc atagacgcga gacgacttcg gtcaatggaa 5880 
gcctttttct caaccatcga gaactaggcc gtttgtttgg tggcgaccat cgccaccaaa 5940 
aaaacaaacg ttcgtcgtct aatgcgcgtc tttttttcct agagttcttc taggaaacta 6000 
gaaaagatgc cccagactgc gagtcacctt gcttttgagt gcaattccct aaaaccagta 6060 
ctctaatagt ttttcctaga agtggatcta ggaaaattta atttttactt caaaatttag 6120 
ttagatttca tatatactca tttgaaccag actgtcaatg gttacgaatt agtcactccg 6180 
tggatagagt cgctagacag ataaagcaag taggtatcaa cggacfcgagg ggcagcacat 6240 
ctattgatgc tatgccctcc cgaatggtag accggggtca cgacgttact atggcgctct 6300 
gggtgcgagt ggccgaggtc taaatagtcg ttatttggtc ggtcggcctt cccggctcgc 6360 
gtcttcacca ggacgttgaa ataggcggag gtaggtcaga taattaacaa cggcccttcg 6420 
atctcattca tcaagcggtc aattatcaaa cgcgttgcaa caacggtaac gatgtccgta 6480 
gcaccacagt gcgagcagca aaccataccg aagtaagtcg aggccaaggg ttgctagttc 6540 
cgctcaatgt actagggggt acaacacgtt ttttcgccaa tcgaggaagc caggaggcta 6600 
gcaacagtct tcattcaacc ggcgtcacaa tagtgagtac caataccgtc gtgacgtatt 6660 
aagagaatga cagtacggta ggcattctac gaaaagacac tgaccactca tgagttggtt 6720 
cagtaagact cttatcacat acgccgctgg ctcaacgaga acgggccgca gttatgccct 6780 
attatggcgc ggtgtatcgt cttgaaattt tcacgagtag taaccttttg caagaagccc 6840 
cgcttttgag agttcctaga atggcgacaa ctctaggtca agctacattg ggtgagcacg 6900 
tgggttgact agaagtcgta gaaaatgaaa gtggtcgcaa agacccactc gtttttgtcc 6960 
ttccgtttta cggcgttttt tcccttattc ccgctgtgcc tttacaactt atgagtatga 7020 
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gaaggaaaaa gttataataa cttcgtaaat agtcccaata acagagtact cgcctatgta 7080 
taaacttaca taaatctttt tatttgttta tccccaaggc gcgtgtaaag gggcttttca 7140 
cggtggactg cag 7153 



<210> 23 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 23 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cc tgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttataa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaactttga agtggaattc 1800 
cgacatgact caggatatga agttcatcat caagtattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 
agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 
tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 
tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 
aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 
aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 
cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 
atgcagaact ag 2352 

<210> 24 
<211> 783 
<212> PRT 

<213> fusion protein - human 
<400> 24 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
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20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 

50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe He Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 

210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 

355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 

370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 425 ~ 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 

435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 

450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
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515 






520 










525 








Val 


Glu 


Leu 


Leu 


Pro Val Asn 


Gly Glu 


Phe 


Ser 


Leu Asp 


Asp 


Leu 


Gin 




530 






535 










540 










Pro 


Trp 


His 


Ser 


Phe Gly Ala 


Asp 


Ser 


Val 


Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


545 








550 








555 










560 


Glu 


Val 


Glu 


Pro Val Asp Ala 


Arg 


Pro 


Ala Ala Asp Arg 


Gly 


Leu 


Thr 










565 






570 










575 




Thr 


Arg 


Pro 


Gly Ser Gly Leu 


Thr 


Asn 


He 


Lys 


Thr 


Glu 


Glu 


He 


Ser 








580 






585 










590 






Glu 


Val 


Asn 


Phe 


Glu Val Glu 


Phe 


Arg 


His 


Asp 


Ser Gly 


Tyr 


Glu 


Val 






595 






600 










605 








His 


His 


Gin 


Val 


Leu Val Phe 


Phe 


Ala 


Glu Asp Val 


Gly 


Ser 


Asn 


Lys 




610 






615 










620 










Gly 


Ala 


He 


He 


Gly Leu Met 


Val 


Gly 


Gly Val Val 


He 


Ala 


Thr 


Val 


625 








630 








635 










640 


He 


Val 


He 


Thr 


Leu Val Met 


Leu Lys 


Lys 


Lys 


Lys 


Leu 


Gly 


Thr 


Glu 










645 






650 










655 




Leu 


Gly Ser 


Thr 


Ser Pro Val 


Trp 


Trp 


Asn 


Ser 


Ala 


Asp 


He 


Leu 


Glu 








660 






665 










670 






Pro 


Val 


Asp 
675 


Pro 


Arg Leu Glu 


Pro 
680 


Trp 


Lys 


His 


Pro 


Gly 
685 


Ser 


Gin 


Pro 


Lys 


Thr 
690 


Ala 


Cys 


Thr Asn Cys 
695 


Tyr 


Cys 


Lys 


Lys 


Cys 
700 


Cys 


Phe 


His 


Cys 


Gin 


Val 


Cys 


Phe 


Met Thr Lys 


Ala 


Leu 


Gly He 


Ser 


Tyr 


Gly 


Arg 


Lys 


705 








710 








715 










720 


Lys 


Arg Arg 


Gin Arg Arg Arg 


Ala 


His 


Gin 


Asn 


Ser 


Gin 


Thr 


His 


Gin 










725 






730 










735 




Ala 


Ser 


Leu 


Ser 
740 


Lys Gin Arg 


He 


Ser 
745 


Ser 


Thr 


Val 


Ala 


Ala 
750 


Ala 


Asp 


Ala 


Ala 


Val 


Thr 


Pro Glu Glu 


Arg 


His 


Leu 


Ser Lys 


Met 


Gin 


Gin 


Asn 






755 






760 










765 








Gly 


Tyr Glu 


Asn Pro Thr Tyr 


Lys 


Phe 


Phe 


Glu 


Gin 


Met 


Gin 


Asn 






770 






775 










780 











<210> 25 
<211> 2823 
<212> DNA 

<213> fusion protein - human 
<400> 25 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt 

ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc 

acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta 

cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg 

ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg 

gagtttataa gtgatgccct tctcgttcct gacaagtgca aattcttaca 

atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac 

aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga 

ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga 

gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta 

agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt 

gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga 

ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac 

gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc 

gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca 

gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga 

gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat 

caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca 

acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc 

tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa 
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gctggaggta 60 
ctgtggcaga 120 
agggaccaaa 180 
ccctgaactg 240 
gtgcaagcgg 300 
cttagttggt 3 60 
ccaggagagg 420 
atgcagtgag 480 
caagttccga 540 
ttctgctgat 600 
tgcagatggg 660 
ggaagaagaa 720 
ggctgaggaa 780 
caccaccaca 840 
tgatgccgtt 900 
gaaagccaaa 960 
atgggaagag 1020 
ccagcatttc 1080 
gctggtggag 1140 
cctggagaac 1200 
tatgctaaag 1260 
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aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
.tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcgtcatca 
agtccagtgt 
gatatttgcc 
ctgaagaaca 
gcacatctga 
tttcctcgag 
ttgttaacag 
gcttcagtgg 
tcatcggaag 
gatctggccc 
gtggcgatgg 
gattccccgg 
gccgacttcg 
gatatccagc 
aagatgcagc 
tag 



gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
gacttaaaaa 
actgggagtg 
cagaagtgga 
aagaccttga 
gattatttgt 
agactgatat 
agagtagtaa 
ccccgaccga 
cgcatgccga 
gtccgggatt 
agtttgagca 
acagtggcgg 
agaacggcta 



gaaggacaga 
agccgctcag 
tcagtctctc 
t gat gage tg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gaeggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgeagatate 
gctcaagtgc 
tcgctactct 
atcaaggcta 
catgattttg 
acaagataat 
gcctctaaca 
caaaggtcaa 
tgtcagcctg 
cgcgctagac 
taccccccac 
gatgtttacc 
ccgcgacgcc 
cgaaaatcca 



cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 
aagctactgt 
tccaaagaaa 
cccaaaacca 
gaaagactgg 
aaaatggatt 
gtgaataaag 
ttgagacagc 
agacagttga 
ggggacgagc 
gatttcgatc 
gactccgccc 
gatgeccttg 
gctgtcaccc 
acctacaagt 



taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaactttga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 
atgccgtcac 
atagaataag 
ctgtatcggg 
tccacttaga 
tggacatgtt 
cctacggcgc 
gaattgacga 
cagaggagcg 
tctttgagca 



cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
agtggaattc 
agaagatgtg 
agegacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 
agatagattg 
tgegacatea 
aattcccggg 
eggegaggae 
gggggacggg 
tctggatatg 
gtacggtggg 
ccacctgtcc 
gatgeagaac 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2823 



<210> 26 
<211> 941 
<212> PRT 

<213> fusion protein - human 
<400> 26 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala 

15 10 
Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly 

20 25 
Gin He Ala Met Phe Cys Gly Arg Leu Asn Met 

35 40 
Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr 

50 55 
Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu 
65 70 75 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro 

85 90 
Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr 

100 105 
He Pro Tyr Arg Cys Leu Val Gly Glu Phe He 

115 120 
Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu 

130 135 
Glu Thr His Leu His Trp His Thr Val Ala Lys 
145 150 155 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu 

165 170 
Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys 

180 185 
Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu 

195 200 
Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp 
210 215 



Ala Trp Thr 

Leu Leu Ala 
30 

His Met Asn 
45 

Lys Thr Cys 
60 

Val Tyr Pro 

Val Thr He 

His Pro His 
110 

Ser Asp Ala 

125 
Arg Met Asp 
140 

Glu Thr Cys 

Leu Pro Cys 

Pro Leu Ala 
190 

Asp Asp Ser 

205 
Gly Ser Glu 
220 



Ala Arg 
15 

Glu Pro 

Val Gin 

He Asp 

Glu Leu 
80 

Gin Asn 
95 

Phe Val 

Leu Leu 

Val Cys 

Ser Glu 
160 
Gly He 
175 

Glu Glu 
Asp Val 
Asp Lys 
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vai 


vai giu vai 


Aia 


GIU GIU GIU Glu vai Aia 


Giu vai 


GlU 


GlU 


GlU 


ZZj 






9 "3 n o "3 c: 








240 


blU 


Ala Asp Asp 


Asp 


Glu Asp Asp Glu Asp Gly 


Asp Glu 


vai 


Glu 


Glu 






245 


25U 






0 c c 

255 




GlU 


Ala Glu Glu 


Pro 


Tyr Glu Glu Ala Thr Glu 


Arg Thr 


Thr 


Ser 


He 




2oU 




o c c 
265 




OTA 
2 / U 






Aia 


Thr Thr Thr 


Thr 


Thr Thr Thr Glu Ser Vai 


GlU GlU 


vai 


Vai 


Arg 




275 




280 


285 






vai 


Pro Thr Thr 


Ala 


Ala Ser Thr Pro Asp Ala 


Vai Asp 


Lys 


Tyr 


Leu 




o o a 

29 u 




A A C 

295 


0 n n 
300 








Glu 


Thr Pro Gly 


Asp 


Glu Asn Glu His Ala His 


Phe Gin 


Lys 


Ala 


Lys 


*a n r 

ODD 






■Jin "3 1 C 

31U 315 








"3 O A 

320 


Glu 


Arg Leu Glu 


Ala 


Lys His Arg Glu Arg Met 


Ser Gin 


Vai 


Met 


Arg 






325 


330 






335 




Glu 


Trp Glu Glu 


Ala 


Glu Arg Gin Ala Lys Asn 


Leu Pro 


Lys 


Ala 


Asp 




i a r\ 
34U 




345 




*3 C A 

350 




Lys 


Lys Ala Vai 


Tl -v 

He 


Gin His Pne Gin Glu Lys 


Vai Glu 


Ser 


Leu 


Glu 




o c n 
j J J 




360 


365 








Gill 


Glu Ala Ala 


Asn 


Glu Arg Gin Gin Leu Vai 


Glu Thr 


His 


Met 


Ala 




370 




n "~1 c 

375 


380 








Arg 


Vai Glu Ala 


Met 


Leu Asn Asp Arg Arg Arg 


Leu Ala 


Leu 


Glu 


Asn 


•3 O C 






0 a a ■joc 

390 395 








400 


Tyr 


lie Thr Ala 


Leu 


Gin Ala Vai Pro Pro Arg 


Pro Arg 


His 


Vai 


Phe 






A A C 

405 


A 1 A 

410 






415 




As n 


Met Leu Lys 


Lys 


Tyr Vai Arg Ala Glu Gin 


Lys Asp 


Arg 


Gin 


His 




420 




425 




430 






Thx 


Leu Lys His 


Phe 


Glu His Vai Arg Met Vai 


Asp Pro 


Lys 


Lys 


Ala 




/on 
435 




440 


445 








Ala 


Gin lie Arg 


Ser 


Gin Vai Met Tnr His Leu 


Arg Vai 


He 


Tyr 


Glu 




/ C A 

450 




A C C 

455 


460 








Arg 


Met Asn Gin 


Ser 


Leu Ser Leu Leu Tyr Asn 


Vai Pro 


Ala 


Vai 


Ala 


465 






a n n a 1 c 

470 475 








480 


Glu 


Glu lie Gin 


Asp 


Glu Vai Asp Glu Leu Leu 


Gin Lys 


Glu 


Gin 


Asn 






485 


490 






495 




Tyr 


Ser Asp Asp 


Vai 


Leu Ala Asn Met He Ser 


Glu Pro 


Arg 


He 


Ser 




500 




505 




510 






Tyr 


Gly Asn Asp 


Ala 


Leu Met Pro Ser Leu Thr 


Glu Thr 


Lys 


Thr 


Thr 




515 




520 


525 








vai 


Glu Leu Leu 


Pro 


Vai Asn Gly Glu Pne Ser 


Leu Asp 


Asp 


Leu 


Gin 




53 U 




535 


C A A 

D4U 






Pro 


Trp His Ser 


Pne 


Gly Ala Asp Ser Vai Pro 


Ala Asn 


Thr 


Glu 


Asn 


c / c 
545 






55U 555 








C C A 

560 


GlU 


vai Giu Pro 


vai 


Asp Ala Arg Pro Ala Ala 


Asp Arg 


Gly 


Leu 


Thr 








5 /U 






575 




rnr 


Arg Pro Gly 


Ser 


Gly Leu Thr Asn He Lys 


Tnr Glu 


Glu 


He 


Ser 




con 




c 0 c 
5o5 




con 






GlU 


Vai Asn Pne 


GIU 


Vai Glu Pne Arg His Asp 


Ser Gly 


Tyr 


Glu 


Vai 




595 




600 


605 








His 


His Gin Vai 


Leu 


Vai Phe Phe Ala Glu Asp 


Vai Gly 


Ser 


Asn 


Lys 




610 




615 


620 






Gly 


Ala He He 


Gly 


Leu Met Vai Gly Gly Vai 


Vai He 


Ala 


Thr 


Vai 


625 






630 635 








640 


lie 


Vai He Thr 


Leu 


Vai Met Leu Lys Lys Lys 


Lys Leu 


Gly 


Thr 


Glu 






645 


650 






655 




Leu 


Gly Ser Thr 


Ser 


Pro Vai Trp Trp Asn Ser 


Ala Asp 


He 


Lys 


Leu 




660 




665 




670 






Leu 


Ser Ser He 


Glu 


Gin Ala Cys Asp He Cys 


Arg Leu 


Lys 


Lys 


Leu 




675 




680 


685 








Lys 


Cys Ser Lys 


Glu 


Lys Pro Lys Cys Ala Lys 


Cys Leu 


Lys 


Asn 


Asn 




690 




695 


700 








Trp 


Glu Cys Arg 


Tyr 


Ser Pro Lys Thr Lys Arg 


Ser Pro 


Leu 


Thr 


Arg 



705 



710 



715 



720 
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Ala 


His 


Leu Tnr Glu vai biu t>er Arg j_»eu biu 


Arg 


Leu 


bXU 


OJ.Il 


Leu 






T)c Tin 








/ o j 




Pne 


Leu 


Leu lie File Fro Arg oiu Asp Lieu Asp 






Leu 


ijy o 


jyie u 






/4U / 43 






/ 3U 






Asp 


Ser 


Leu Gin Asp lie Lys Ala Leu Leu Thr 


Lily 


Leu 


rfie 


Val 


oin 




755 /bU 




/ D J 








Asp 


Asn 


Val Asn Lys Asp Ala Val Thr Asp Arg 


Leu 


Aia 


Ser 


Vai 


biU 


770 


775 


/oU 










Thr 


Asp 


Met Pro Leu Thr Leu Arg Gin His Arg 


lie 


Ser 


Ala 


Thr 


Ser 


785 


/yu /yo 










pnn 

QUU 


Ser 


Ser 


Glu Glu Ser Ser Asn Lys Gly Gin Arg 


Gin 


Leu 


Tnr 


vai 


Ser 






805 810 








Q1 C 

ol5 




Gly 


lie 


Pro Gly Asp Leu Ala Pro Pro Thr Asp 


vai 


Ser 


Leu 


oiy 


Asp 




820 825 






oJU 






Glu 


Leu 


His Leu Asp oiy valu Asp vai Aia Met 


Ala 


alb 




Asp 


Ala 






835 840 




845 








Leu 


Asp 


Asp Phe Asp Leu Asp Met Leu Gly Asp 


Gly 


Asp 


Ser 


Pro 


Gly 




850 


855 


860 










Pro 


Gly 


Phe Thr Pro His Asp Ser Ala Pro Tyr 


Gly 


Ala 


Leu 


Asp 


Met 


865 


870 875 










880 


Ala 


Asp 


Phe Glu Phe Glu Gin Met Phe Thr Asp 


Ala 


Leu 


Gly 


He 


Asp 




885 890 








895 




Glu 


Tyr 


Gly Gly Asp He Gin His Ser Gly Ala 


Ala 


Ala 


Asp 


Ala 


Ala 




900 905 






910 






Val 


Thr 


Pro Glu Glu Arg His Leu Ser Lys Met 


Gin 


Gin 


Asn 


Gly 


Tyr 






915 920 




925 








Glu 


Asn 


Pro Thr Tyr Lys Phe Phe Glu Gin Met 


Gin 


Asn 










930 


935 


940 











<210> 27 
<211> 5015 
<212> DNA 
<213> human 



<400> 27 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gtttaaactt 

agatatccag 

gcctcgactg 

ttgaccctgg 

cattgtctga 

gaggattggg 

gcggaaagaa 

agcgcggcgg 

cccgctcctt 

gctctaaatc 

aaaaaacttg 



gagatctccc 
aagccagtat 
ttaagctaca 
gcgttttgcg 
tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
aagcttggta 
cacagtggcg 
tgccttctag 
aaggtgccac 
gtaggtgtca 
aagacaatag 
ccagctgggg 
gtgtggtggt 
tcgctttctt 
ggggcatccc 
attagggtga 



gatcccctat 
ctgctccctg 
acaaggcaag 
ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
ccgagctcgg 
gccgctcgag 
ttgccagcca 
tcccactgtc 
ttctattctg 
caggcatgct 
ctctaggggg 
tacgcgcagc 
cccttccttt 
tttagggttc 
tggttcacgt 



ggtcgactct 
cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
atccactagt 
tctagagggc 
tctgttgttt 
ctttcctaat 
gggggtgggg 
ggggatgcgg 
tatccccacg 
gtgaccgcta 
ctcgccacgt 
cgatttagtg 
agtgggccat 



cagtacaatc 
ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ccagtgtggt 
ccgtttaaac 
gcccctcccc 
aaaatgagga 
tggggcagga 
tgggctctat 
cgccctgtag 
cacttgccag 
tcgccggctt 
ctttacggca 
cgccctgata 



tgctctgatg 
gagtagtgcg 
aagaatctgc 
cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctggctagc 
ggaattctgc 
ccgctgatca 
cgtgccttcc 
aattgcatcg 
cagcaagggg 
ggcttctgag 
cggcgcatta 
cgccctagcg 
tccccgtcaa 
cctcgacccc 
gacggttttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca 1620 

acactcaacc ctatctcggt ctattctttt gatttataag ggattttggg gatttcggcc 1680 

tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattaatt ctgtggaatg 1740 

tgtgtcagtt agggtgtgga aagtccccag gctccccagg caggcagaag tatgcaaagc 1800 

atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc agcaggcaga 1860 

agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct aactccgccc 1920 

atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 1980 

tttatttatg cagaggccga ggccgcctct gcctctgagc tattccagaa gtagtgagga 2040 

ggcttttttg gaggcctagg cttttgcaaa aagctcccgg gagcttgtat atccattttc 2100 

ggatctgatc agcacgtgtt gacaattaat catcggcata gtatatcggc atagtataat 2160 

acgacaaggt gaggaactaa accatggcca agttgaccag tgccgttccg gtgctcaccg 2220 

cgcgcgacgt cgccggagcg gtcgagttct ggaccgaccg gctcgggttc tcccgggact 2280 

tcgtggagga cgacttcgcc ggtgtggtcc gggacgacgt gaccctgttc atcagcgcgg 2340 

tccaggacca ggtggtgccg gacaacaccc tggcctgggt gtgggtgcgc ggcctggacg 2400 

agctgtacgc cgagtggtcg gaggtcgtgt ccacgaactt ccgggacgcc tccgggccgg 2460 

ccatgaccga gatcggcgag cagccgtggg ggcgggagtt cgccctgcgc gacccggccg 2520 

gcaactgcgt gcacttcgtg gccgaggagc aggactgaca cgtgctacga gatttcgatt 2580 

ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac gccggctgga 2640 

tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 2700 

cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 27 60 

tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctgta 2820 

taccgtcgac ctctagctag agcttggcgt aatcatggtc atagctgttt cctgtgtgaa 2880 

attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct 2940 

ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc 3 000 

agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 3 060 

gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 3120 

ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 3180 

gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 3240 

aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 33 00 

gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 33 60 

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 3420 

cctttctccc ttcgggaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt 3480 

cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 3540 

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 3600 

cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag 3660 

agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg 3720 

ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 3780 

ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 3840 

gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact 3900 

cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa 3960 

attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 4020 

accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 4080 

ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca 4140 

gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc 4200 

agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt 4260 

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 4320 

ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca 43 80 

gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 4440 

ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca 4500 

tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg 4560 

tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct 4620 

cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca 4680 

tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca 4740 

gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg 4800 

tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 4860 

ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt 4920 

attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc 4980 

cgcgcacatt tccccgaaaa gtgccacctg acgtc 5015 
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<211> 5015 
<212> DNA 
<213> human 

<400> 28 

ctgcctagcc ctctagaggg ctaggggata ccagctgaga gtcatgttag acgagactac 60 

ggcgtatcaa ttcggtcata gacgagggac gaacacacaa cctccagcga ctcatcacgc 120 

gctcgtttta aattcgatgt tgttccgttc cgaactggct gttaacgtac ttcttagacg 180 

aatcccaatc cgcaaaacgc gacgaagcgc tacatgcccg gtctatatgc gcaactgtaa 240 

ctaataactg atcaataatt atcattagtt aatgccccag taatcaagta tcgggtatat 300 

acctcaaggc gcaatgtatt gaatgccatt taccgggcgg accgactggc gggttgctgg 3 60 

gggcgggtaa ctgcagttat tactgcatac aagggtatca ttgcggttat ccctgaaagg 420 

taactgcagt tacccacctg ataaatgcca tttgacgggt gaaccgtcat gtagttcaca 480 

tagtatacgg ttcatgcggg ggataactgc agttactgcc atttaccggg cggaccgtaa 540 

tacgggtcat gtactggaat accctgaaag gatgaaccgt catgtagatg cataatcagt 600 

agcgataatg gtaccactac gccaaaaccg tcatgtagtt acccgcacct atcgccaaac 660 

tgagtgcccc taaaggttca gaggtggggt aactgcagtt accctcaaac aaaaccgtgg 720 

ttttagttgc cctgaaaggt tttacagcat tgttgaggcg gggtaactgc gtttacccgc 780 

catccgcaca tgccaccctc cagatatatt cgtctcgaga gaccgattga tctcttgggt 840 

gacgaatgac cgaatagctt taattatgct gagtgatatc cc tctgggtt cgaccgatcg 900 

caaatttgaa ttcgaaccat ggctcgagcc taggtgatca ggtcacacca ccttaagacg 960 

tctataggtc gtgtcaccgc cggcgagctc agatctcccg ggcaaatttg ggcgactagt 1020 

cggagctgac acggaagatc aacggtcggt agacaacaaa cggggagggg gcacggaagg 1080 

aactgggacc ttccacggtg agggtgacag gaaaggatta ttttactcct ttaacgtagc 1140 

gtaacagact catccacagt aagataagac cccccacccc accccgtcct gtcgttcccc 1200 

ctcctaaccc ttctgttatc gtccgtacga cccctacgcc acccgagata ccgaagactc 1260 

cgcctttctt ggtcgacccc gagatccccc ataggggtgc gcgggacatc gccgcgtaat 1320 

tcgcgccgcc cacaccacca atgcgcgtcg cactggcgat gtgaacggtc gcgggatcgc 1380 

gggcgaggaa agcgaaagaa gggaaggaaa gagcggtgca agcggccgaa aggggcagtt 1440 

cgagatttag ccccgtaggg aaatcccaag gctaaatcac gaaatgccgt ggagctgggg 1500 

ttttttgaac taatcccact accaagtgca tcacccggta gcgggactat ctgccaaaaa 1560 

gcgggaaact gcaacctcag gtgcaagaaa ttatcacctg agaacaaggt ttgaccttgt 1620 

tgtgagttgg gatagagcca gataagaaaa ctaaatattc cctaaaaccc ctaaagccgg 1680 

ataaccaatt ttttactcga ctaaattgtt tttaaattgc gcttaattaa gacaccttac 1740 

acacagtcaa tcccacacct ttcaggggtc cgaggggtcc gtccgtcttc atacgtttcg 1800 

tacgtagagt taatcagtcg ttggtccaca cctttcaggg gtccgagggg tcgtccgtct 1860 

tcatacgttt cgtacgtaga gttaatcagt cgttggtatc agggcgggga ttgaggcggg 1920 

tagggcgggg attgaggcgg gtcaaggcgg gtaagaggcg gggtaccgac tgattaaaaa 1980 

aaataaatac gtctccggct ccggcggaga cggagactcg ataaggtctt catcactcct 2040 

ccgaaaaaac ctccggatcc gaaaacgttt ttcgagggcc ctcgaacata taggtaaaag 2100 

cctagactag tcgtgcacaa ctgttaatta gtagccgtat catatagccg tatcatatta 2160 

tgctgttcca ctccttgatt tggtaccggt tcaactggtc acggcaaggc cacgagtggc 2220 

gcgcgctgca gcggcctcgc cagctcaaga cctggctggc cgagcccaag agggccctga 2280 

agcacctcct gctgaagcgg ccacaccagg ccctgctgca ctgggacaag tagtcgcgcc 2340 

aggtcctggt ccaccacggc ctgttgtggg accggaccca cacccacgcg ccggacctgc 2400 

tcgacatgcg gctcaccagc ctccagcaca ggtgcttgaa ggccctgcgg aggcccggcc 2460 

ggtactggct ctagccgctc gtcggcaccc ccgccctcaa gcgggacgcg ctgggccggc 2520 

cgttgacgca cgtgaagcac cggctcctcg tcctgactgt gcacgatgct ctaaagctaa 2580 

ggtggcggcg gaagatactt tccaacccga agccttagca aaaggccctg cggccgacct 2640 

actaggaggt cgcgccccta gagtacgacc tcaagaagcg ggtggggttg aacaaataac 2700 

gtcgaatatt accaatgttt atttcgttat cgtagtgttt aaagtgttta tttcgtaaaa 2760 

aaagtgacgt aagatcaaca ccaaacaggt ttgagtagtt acatagaata gtacagacat 2820 

atggcagctg gagatcgatc tcgaaccgca ttagtaccag tatcgacaaa ggacacactt 2880 

taacaatagg cgagtgttaa ggtgtgttgt atgctcggcc ttcgtatttc acatttcgga 2940 

ccccacggat tactcactcg attgagtgta attaacgcaa cgcgagtgac gggcgaaagg 3000 

tcagcccttt ggacagcacg gtcgacgtaa ttacttagcc ggttgcgcgc ccctctccgc 3 060 

caaacgcata acccgcgaga aggcgaagga gcgagtgact gagcgacgcg agccagcaag 312 0 

ccgacgccgc tcgccatagt cgagtgagtt tccgccatta tgccaatagg tgtcttagtc 3180 

ccctattgcg tcctttcttg tacactcgtt ttccggtcgt tttccggtcc ttggcatttt 3240 

tccggcgcaa cgaccgcaaa aaggtatccg aggcgggggg actgctcgta gtgtttttag 3300 

ctgcgagttc agtctccacc gctttgggct gtcctgatat ttctatggtc cgcaaagggg 3360 

gaccttcgag ggagcacgcg agaggacaag gctgggacgg cgaatggcct atggacaggc 3420 



-39- 



21040 

WO 03/072041 



PCTAJS03/05458 



ggaaagaggg aagcccttcg caccgcgaaa 

gccacatcca gcaagcgagg ttcgacccga 

cgacgcggaa taggccattg atagcagaac 

gtgaccgtcg tcggtgacca ttgtcctaat 

tcaagaactt caccaccgga ttgatgccga 

gagacgactt cggtcaatgg aagccttttt 

ggtggcgacc atcgccacca aaaaaacaaa 

ctagagttct tctaggaaac tagaaaagat 

gtgcaattcc ctaaaaccag tactctaata 

taatttttac ttcaaaattt agttagattt 

tggttacgaa ttagtcactc cgtggataga 

aacggactga ggggcagcac atctattgat 

cacgacgtta ctatggcgct ctgggtgcga 

tcggtcggcc ttcccggctc gcgtcttcac 

gataattaac aacggccctt cgatctcatt 

aacaacggta acgatgtccg tagcaccaca 

cgaggccaag ggttgctagt tccgctcaat 

aatcgaggaa gccaggaggc tagcaacagt 

accaataccg tcgtgacgta ttaagagaat 

actgaccact catgagttgg ttcagtaaga 

gaacgggccg cagttatgcc ctattatggc 

agtaaccttt tgcaagaagc cccgcttttg 

caagctacat tgggtgagca cgtgggttga 

aaagacccac tcgtttttgt ccttccgttt 

cctttacaac ttatgagtat gagaaggaaa 

taacagagta ctcgcctatg tataaactta 

gcgcgtgtaa aggggctttt cacggtggac 



gagttacgag tgcgacatcc atagagtcaa 3480 
cacacgtgct tggggggcaa gtcgggctgg 3540 
tcaggttggg ccattctgtg ctgaatagcg 3600 
cgtctcgctc catacatccg ccacgatgtc 3660 
tgtgatcttc ctgtcataaa ccatagacgc 372 0 
ctcaaccatc gagaactagg ccgtttgttt 3780 
cgttcgtcgt ctaatgcgcg tctttttttc 3840 
gccccagact gcgagtcacc ttgcttttga 3900 
gtttttccta gaagtggatc taggaaaatt 3960 
catatatact catttgaacc agactgtcaa 4020 
gtcgctagac agataaagca agtaggtatc 4080 
gctatgccct cccgaatggt agaccggggt 4140 
gtggccgagg tctaaatagt cgttatttgg 4200 
caggacgttg aaataggcgg aggtaggtca 4260 
catcaagcgg tcaattatca aacgcgttgc 4320 
gtgcgagcag caaaccatac cgaagtaagt 4380 
gtactagggg gtacaacacg ttttttcgcc 4440 
cttcattcaa ccggcgtcac aatagtgagt 4500 
gacagtacgg taggcattct acgaaaagac 4560 
ctcttatcac atacgccgct ggctcaacga 4620 
gcggtgtatc gtcttgaaat tttcacgagt 4680 
agagttccta gaatggcgac aactctaggt 4740 
ctagaagtcg tagaaaatga aagtggtcgc 4800 
tacggcgttt tttcccttat tcccgctgtg 4860 
aagttataat aacttcgtaa atagtcccaa 4920 
cataaatctt tttatttgtt tatccccaag 4980 
tgcag 5015 



<210> 29 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 29 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 

cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 

ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 

acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 

cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 3 00 

ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 3 60 

gagtttataa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 

atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 

aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 

ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 

gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 

agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 

gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 

ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 

gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 

gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 

gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 

gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 

caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 

acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 

tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 

aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 

cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 

gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 

gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 

gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
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tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaactttga agtggaattc 1800 
cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 
agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 
tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 
tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 
aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 
aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 
cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 
atgcagaact ag 2352 

<210> 30 
<211> 783 
<212> PRT 

<213> fusion protein - human 



<400> 30 
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Gin 


He 
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Met 
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Leu 


Asn 


Met 
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Met 


Asn 
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Gin 
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Thr 


Lys 


Glu 
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He 


Leu 
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Gin 
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Tyr 
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Glu 


Leu 
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Gin 


He 


Thr 


Asn 


Val 
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Glu 
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Gin 


Pro 
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Thr 


He 


Gin 
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Trp 


Cys 


Lys 
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Lys 


Gin 


Cys 


Lys 


Thr 


His 


Pro 


His 
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He 
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Asp 


Ala 


Leu 
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Phe 


Leu 
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Gin 
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Leu 
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Trp 
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Thr 
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Thr 
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Glu 
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Lys 
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He 
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Lys 


Phe 
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Val 


Glu 


Phe 
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Pro 
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Glu 
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Val 
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200 










205 








Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 


Tyr 


Ala 


Asp 


Gly 


Ser 


Glu 


Asp 


Lys 


210 










215 










220 










Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


Glu 


Val 


Ala 


Glu 


Val 


Glu 


Glu 


Glu 


225 










230 










235 










240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp 


Gly 


Asp 


Glu 


Val 


Glu 


Glu 






245 










250 










255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


He 








260 










265 










270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 


Ser 


Val 


Glu 


Glu 


Val 


Val 


Arg 






275 










280 










285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 


Asp 


Ala 


Val 


Asp 


Lys 


Tyr 


Leu 




290 










295 










300 










Glu 


Thr 


Pro 


Gly 


Asp 


GlU 


Asn 


Glu 


His 


Ala 


His 


Phe 


Gin 


Lys 


Ala 


Lys 


305 








310 










315 










320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 


Arg 


Met 


Ser 


Gin 


Val 


Met 


Arg 
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325 



330 



335 



Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 

355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 

370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 

435 440 445 

Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 

450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 

515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 

580 585 590 

Glu Val Asn Phe Glu Val Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 

610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Lys Leu Gly Thr Glu 

645 650 655 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Leu Glu 

660 665 670 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

675 680 685 

Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe His Cys 

690 695 700 

Gin Val Cys Phe Met Thr Lys Ala Leu Gly He Ser Tyr Gly Arg Lys 
705 710 715 720 

Lys Arg Arg Gin Arg Arg Arg Ala His Gin Asn Ser Gin Thr His Gin 

725 730 735 

Ala Ser Leu Ser Lys Gin Arg He Ser Ser Thr Val Ala Ala Ala Asp 

740 745 750 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 

755 760 765 

Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
770 775 780 



<210> 31 
<211> 2823 
<212> DNA 
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<213> fusion protein - human 



<400> 31 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 

agtccagtgt 

gatatttgcc 

ctgaagaaca 

gcacatctga 

tttcctcgag 

ttgttaacag 

gcttcagtgg 

tcatcggaag 

gatctggccc 

gtggcgatgg 

gattccccgg 

gccgacttcg 

gatatccagc 

aagatgcagc 

tag 



gtttggcact 

gtaatgctgg 

acatgaatgt 

ataccaagga 

atgtggtaga 

agtgcaagac 

gtgatgccct 

gcgaaactca 

acttgcatga 

ttgtgtgttg 

atgactcgga 

aagtagtaga 

atgacgagga 

aagccacaga 

aagaggtggt 

tcgagacacc 

aggccaagca 

aagcaaagaa 

tggaatcttt 

ccagagtgga 

ctctgcaggc 

gcgcagaaca 

atcccaagaa 

agcgcatgaa 

aggatgaagt 

acatgattag 

aaacgaaaac 

agccgtggca 

ctgttgatgc 

caaatatcaa 

caggatatga 

aaggtgcaat 

ccttggtgat 

ggtggaattc 

gacttaaaaa 

actgggagtg 

cagaagtgga 

aagaccttga 

gattatttgt 

agactgatat 

agagtagtaa 

ccccgaccga 

cgcatgccga 

gtccgggatt 

agtttgagca 

acagtggcgg 

agaacggcta 



gctcctgctg 

cctgctggct 

ccagaatggg 

aggcatcctg 

agccaaccaa 

ccatccccac 

tctcgttcct 

tcttcactgg 

ctacggcatg 

cccactggct 

tgtctggtgg 

agtagcagag 

cgatgaggat 

gagaaccacc 

tcgagttcct 

tggggatgag 

ccgagagaga 

cttgcctaaa 

ggaacaggaa 

agccatgctc 

tgttcctcct 

gaaggacaga 

agccgctcag 

tcagtctctc 

tgatgagctg 

tgaaccaagg 

caccgtggag 

ttcttttggg 

ccgccctgct 

gacggaggag 

agttcatcat 

cattggactc 

gctgaagaag 

tgcagatatc 

gctcaagtgc 

tcgctactct 

atcaaggcta 

catgattttg 

acaagataat 

gcctctaaca 

caaaggtcaa 

tgtcagcctg 

cgcgctagac 

taccccccac 

gatgtttacc 

ccgcgacgcc 

cgaaaatcca 



gccgcctgga 

gaaccccaga 

aagtgggatt 

cagtattgcc 

ccagtgacca 

tttgtgattc 

gacaagtgca 

cacaccgtcg 

ttgctgccct 

gaagaaagtg 

ggcggagcag 

gaggaagaag 

ggtgatgagg 

agcattgcca 

acaacagcag 

aatgaacatg 

atgtcccagg 

gctgataaga 

gcagccaacg 

aatgaccgcc 

cggcctcgtc 

cagcacaccc 

atccggtccc 

tccctgctct 

cttcagaaag 

atcagttacg 

ctccttcccg 

gctgactctg 

gccgaccgag 

atctctgaag 

caaaaattgg 

atggtgggcg 

aaaaagcttg 

aagctactgt 

tccaaagaaa 

cccaaaacca 

gaaagactgg 

aaaatggatt 

gtgaataaag 

ttgagacagc. 

agacagttga 

ggggacgagc 

gatttcgatc 

gactccgccc 

gatgcccttg 

gctgtcaccc 

acctacaagt 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaactttga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 
atgccgtcac 
atagaataag 
ctgtatcggg 
tccacttaga 
tggacatgtt 
cctacggcgc 
gaattgacga 
cagaggagcg 
tctttgagca 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
agtggaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 
agatagattg 
tgcgacatca 
aattcccggg 
cggcgaggac 
gggggacggg 
tctggatatg 
gtacggtggg 
ccacctgtcc 
gatgcagaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2823 



<210> 32 
<211> 941 
<212> PRT 

<213> fusion protein - human 
<400> 32 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 
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Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 

50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 HO 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe lie Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 

210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 .350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 

355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 

370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 

435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 

450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 
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Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 

580 585 590 

Glu Val Asn Phe Glu Val Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 

610 615 620 

Gly Ala lie lie Gly Leu Met Val Gly Gly Val Val lie Ala Thr Val 
625 630 635 640 

lie Val lie Thr Leu Val Met Leu Lys Lys Lys Lys Leu Gly Thr Glu 

645 650 655 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Lys Leu 

660 665 670 

Leu Ser Ser He Glu Gin Ala Cys Asp He Cys Arg Leu Lys Lys Leu 

675 680 685 

Lys Cys Ser Lys Glu Lys Pro Lys Cys Ala Lys Cys Leu Lys Asn Asn 

690 695 700 

Trp Glu Cys Arg Tyr Ser Pro Lys Thr Lys Arg Ser Pro Leu Thr Arg 
705 710 715 720 

Ala His Leu Thr Glu Val Glu Ser Arg Leu Glu Arg Leu Glu Gin Leu 

725 730 735 

Phe Leu Leu He Phe Pro Arg Glu Asp Leu Asp Met He Leu Lys Met 

740 745 750 

Asp Ser Leu Gin Asp He Lys Ala Leu Leu Thr Gly Leu Phe Val Gin 

755 760 765 

Asp Asn Val Asn Lys Asp Ala Val Thr Asp Arg Leu Ala Ser Val Glu 

770 775 780 

Thr Asp Met Pro Leu Thr Leu Arg Gin His Arg He Ser Ala Thr Ser 
785 790 795 800 

Ser Ser Glu Glu Ser Ser Asn Lys Gly Gin Arg Gin Leu Thr Val Ser 

805 810 815 

Gly He Pro Gly Asp Leu Ala Pro Pro Thr Asp Val Ser Leu Gly Asp 

820 825 830 

Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His Ala Asp Ala 

835 840 845 

Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly 

850 855 860 

Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met 
865 870 875 880 

Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu Gly He Asp 

885 890 895 

Glu Tyr Gly Gly Asp He Gin His Ser Gly Ala Ala Ala Asp Ala Ala 

900 905 910 

Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn Gly Tyr 

915 920 925 

Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
930 935 940 



<210> 33 
<211> 63 
<212> PRT 
<213> human 



Glu Glu He Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser 



<400> 33 



1 



5 



10 



15 
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Gly Tyr Glu Val His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala He He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 34 
<211> 4 
<212> PRT 
<213> human 

<400> 34 
Lys Met Asp Ala 
1 



<210> 35 
<211> 7 
<212> PRT 
<213> human 

<400> 35 

Gly Val Val He Ala Thr Val 
1 5 



<210> 36 
<211> 63 
<212> PRT 
<213> human 

<400> 36 

Glu Glu He Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser 

15 10 15 

Gly Tyr Glu Val His His Gin Val Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala He He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 37 
<211> 63 
<212> PRT 
<213> human 

<400> 37 

Glu Glu He Ser Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser 

15 10 15 

Gly Tyr Glu Val His His Gin Val Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala He He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 38 
<211> 4 
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<212> PRT 
<213> human 



<400> 38 
Asn Leu Asp Ala 
1 



<210> 39 
<211> 63 
<212> PRT 
<213> human 

<400> 39 

Glu Glu lie Ser Glu Val Asn Phe Glu Val Glu Phe Arg His Asp Ser 

15 10 15 

Gly Tyr Glu Val His His Gin Val Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala lie He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 40 
<211> 4 
<212> PRT 
<213> human 

<400> 40 
Asn Phe Glu Val 
1 



<210> 41 
<211> 5 
<212> PRT 
<213> human 

<400> 41 

Val Asn Phe Ala Val 
1 5 



<210> 42 
<211> 5 
<212> PRT 
<213> human 

<400> 42 

Val Lys Val Asp Ala 
1 5 



<210> 43 
<211> 5 
<212> PRT 
<213> human 

<400> 43 

Trp Lys Met Asp Ala 
1 5 
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<210> 44 
<211> 5 
<212> PRT 
<213> human 

<400> 44 

Val Lys Ala Asp Ala 
1 5 



<210> 45 
<211> 5 
<212> PRT 
<213> human 

<400> 45 

Val Lys Lys Asp Ala 
1 5 



<210> 46 
<211> 5 
<212> PRT 
<213> human 

<400> 46 

Val Lys Glu Asp Ala 
1 5 



<210> 47 
<211> 5 
<212> PRT 
<213> human 

<400> 47 

Val Lys He Asp Ala 
1 5 



<210> 48 
<211> 5 
<212> PRT 
<213> human 

<400> 48 

Val Lys Met He Ala 
1 5 



<210> 49 
<211> 5 
<212> PRT 
<213> human 

<400> 49 

Val Lys Met Asn Ala 
1 5 
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<210> 50 
<211> 5 
<212> PRT 
<213> human 

<400> 50 

Val Lys Met Glu Ala 
1 5 



<210> 51 
<211> 5 
<212> PRT 
<213> human 

<400> 51 

Val Lys Met Asp Glu 
1 5 



<210> 52 
<211> 5 
<212> PRT 
<213> human 

<400> 52 

Val Lys Met Asp Lys 
1 5 



<210> 53 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 53 

ggagaggata tcatggagcc agtagat 27 

<210> 54 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 



<210> 55 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 55 

ggatgtgata tctttcttct tcagcatcac caagg 35 



<400> 54 

tacatggcgg ccgcctactt actgctttg 



29 
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<210> 56 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 56 

ctgagatatc aagctactgt cttctatcga acaagc 36 

<210> 57 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 58 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 58 

ctgctgtggc ggccgcctag ttctgcatct gctc 34 

<210> 59 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 60 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 60 

atggtgtggc ggccgcagac gccgctgtca cc 32 

<210> 61 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<400> 57 

gcgcgatatc cccaccgtac tcgtcaattc c 



31 



<400> 59 

tgccccgcgc ggccgcgcga tgctgcccgg 



30 



<220> 
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<223> PCR Primer 
<400> 61 

agcgcacaag cttccccgcg cagggtcgcg atgctg 36 

<210> 62 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 62 

ggatgtaagc tttttcttct tcagcatcac caagg 35 

<210> 63 
<2U> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 63 

tgcagatatc ctggagccag tagatcctag ac 32 

<210> 64 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<400> 64 

gctggatatc ctctgctttg atagagaagc 



30 
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= (54) Title: ASSAYS TO MONITOR AMYLOID PRECURSOR PROTEIN PROCESSING 

aaa (57) Abstract: The present invention provides DNA constructs, genetically engineered host cells, and methods for identifying 
5§jgg inhibitors of amyloid precursor protein (APP) processing. The methods provide for the convenient identification, in a single assay, 

of inhibitors of p-secretase and y-secretase as well as other forms of APP processing. The methods rely on fusion proteins of APP 

^ and transcription factors in which APP processing releases the transcription factors, allowing the transcription factors to activate 

transcription of a reporter gene. Inhibitors are identified as substances that block or diminish transcription factor release from the 

fusion protein, thereby causing a diminution of reporter gene readout. 
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This application contains the following inventions or groups of inventions which are not so linked as to form a single general inventive 
concept under PCT Rule 13.1 . In order for all inventions to be examined, the appropriate additional examination fees must be paid. 
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